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AACACGGTTC CTGTGAAATT TTTCAAAGAA CAGGCTAGGA TGATGTCTAT GTCGGACCGA 
CGTGAGCAAT TTCAATACGC ATTTTTGGAT TCAGGTATAG GAGGATTGCC CTACGCACAC 
GCCTTACGCG TGCGTGTGCC TGAGGCCTCA CTGGTGTACG TGGCGGACCG TGTATACTTT 
CCTTATGGGA ATAAAAGTTC TGCACAGATT ATTGCGCGTG CGTCTGCAGT TTTGCAGAAA 
GTGCAGACGA ATTTTTCACC ACACATAGTG GTACTCGCGT GTAACAGCAT GTCTGTCAAT 
GCACTTGAGT TTTTGCGTGC GCAGGTTTCG GTTCCAGTGG TGGGGGTGGT GCCTGCAATT 
AAGCAGGCGG TGGCGTGCAG TCATAAAAAG CACATTGGTG TCTTAGCTAC ACAATGCACG 
ATTACGCATC CGTACACAGC GTGTTTGAGA GCACAGTTTG GTGCaGGGTG TGTGTTTCAG 
AATgcTGCGG ATGCACGCCT TATTGAGTGT CTTGAGCGCG GGTTAATTTT TGAAgTCgAA 
GACATGCaGC GGGAGGCAGT GGCGCGCTCA GTTATGCCCT TCCAGGAAGC GGGGGTGGAT 
GTGCTCGTGC TCGCGTGCAC CCATTTTGTG CACGTGCGTC ATCTTTTTCA GGACTGTGTT 
GGTACCTCGT GTACGGTGGT AGATTCGCTA GAAGGTGTGG TACGCAGGAC GTTACGTCTG 
TGTCCACCGC AATCTCAATT GCGTGGGAAC GCCGCCTGTT ACGTAACTGG TGCGCGCGAT 
GCAGTGTGCG CGGCACGATA CGCACGGTAT GCGCAGCACT TTGGATTGCG CTGGGCGGGT 
TTTTTGGaCk TATGAACACG GCACTGGATA TCGGGTGCGT GCACTGTGTG TGTTTGTGTG 
GAGGCGGTAG ATAAgAgAgG CTGATAgACA GCGCGGTGCT GCGTGCGTAC AATGGGCCAT 
GGGGAAGCCG AGgTtTCGTG CAGTGGCCTT TGACAT t GAT GGGAC Ac TGT ACCc TGGATG 
GCGCCTTGaT GCGTGTTAtG CCCTTtATGA TTCGCAATGC GCGCTTGATG CGTGCGTTCC 
GTGCGGTGCG TCAGGAGCTA CGTCGTGAGC AACGTACGGC ACTTATTCCT TTTGAAGACT 
TTTTTTTTGC GcAAsTACgC GCATCGCGCC GcGCGTGGGT TTATCTGCAG AAGAAGTGCG 
AGCCTTCCTC GACACAGCGC TGTATCGGGG GTGGAGGCGT CACTTTTTAC ATATAAAGCC 
ATTTCCTCAC GTGCTTTCCT CGGTGTTGGA GCTGAGGCGG CATGGGCTGA AGATAGCGCT 
TTTGTCGGAT TTTCCTCCGA GTCAGAAAGG CTGTCTATGG GGGGTGCGCG CGTTGTGCGA 
TGTAACGTTG GGCACAGAGG AGATTGGGTC CCTCAAGCCT TCTCCCCGGG CCTTTTACGC 
GcTGGCGCAG AG AC TGAATC TGcGCTGTGA AGAAATTCTT TACGTGGGGA ACAGTGTTCA 
TGACGTGGAA GGCGCGCACG CAGCAGGTAT GAGGATTGCC TGTGTGCGCA GgCCCTTTAC 
GAGTCTTCGC GTTCGGCGCA cGCGGaCTGG CTCTTTTCCG ACTATCGCAC ATTGTGcGCA 
TATGTGATAG CATGAGCGCC GGCGCAGGGT AGTCTGCCGA ACCCCACACG TCCAGCGTGG 
CGCCCGCGGG TACCCGCTGT GCGTCGCGTG AAGACGAAGt GAGTGGAGCA TGGAGTACTT 
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TCTGACGGTT GTCATTGCCT GCGCGATTTC CCTCGTGATG GTTGCGTTCT CCCGCCAGCT 
GGACAAGGGT AACCGTTCTC TTGAAAAGGT CAAGCGCTAC GCGkACTACA TAAAGGAAGA 
TCTTGAGTCA TcAGCGCAGA GAAGATTGCG ATGCTCAAGG ATGCGGCCAT CGAGTTAAAT 
GTAAAGCAAG AGCAGGCGAT TGCCTCAGTG AAAAAAATGG ATCACCTCTA CGACCAGTTT 
ATGAAgaAGT CTACTGCGCT TGCGGTGCAA A 
(2) INFORMATION FOR SEQ ID NO: 80: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1773 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
GGAAAAAACC TTTAAATTCC CGGTACAATT GTGCAGAGAG CGTTTTATTA TGACTGATGA 
CAAGnGTCGG TTTTTGCAnG CGTGCAATAA CGTTTGCAAC TGTAAACGTT TTCCCCGAAC 
nGGTAACACC CTTGAGCGTT TGAAAACGCG CCCCCGCATG CAAGCCACGC ACGAGTGCGT 
CGATTGCGGC AATCTGATCG CCTGCAGGTT GGAAAGAAGC GTGTAATTTA AATTCTTTCA 
TGTGCTTCGC AGTATCGGTT ATTACCATAG GATTTAAAAA ATCTGTAGTA CATGCTGTGC 
CTTTCCCTTT GATTTTGCTC TCTTAGAACC TCTGGTTGAC AAACCACACC CATGGTGAAT 
AGAGTGGACC CCGCTTAAGA GGAGGCACGT TATGGTCAAG TTGCTTAGCA TTGGTGGATC 
GGATGCTTCA GGTGGTGCGG GCATCGAGGC GGATCTGAAA ACTTTCCAAG AGTATGGAGC 
GTTCGGTGTG GcTACGCTCA CCGCCATCGT TACTATGGAC CCATCCCGGA AC TGGTCGC A 
TCGTGTACAT TCACTTGAGG AAGACTGTGT GCGCGATCAG CTTGAAACCG CATTTGCAGG 
CGTGGGGGTC AGCGCGGTGA AAAGCGGTAT GCTTGCCTCT GTCCATGCAA TCGAATGTGT 
CGCGGAGTAT CTCGAACGTT TTGCAGTTGC TGCATACGTC TTTGATCCTG TCATGGTATG 
CAAAGGATCG GGAGATGCAT TGCACCGTGA GTTGAACGAA TTGATGATCC AGAAACTTTT 
GCCACGCGCG ACAGTTGTTA CTCCCAATCT TTTTGAAACC GCCCAGATTG CCGGTATCAG 
CGTACCACGG ACAGTGGACG AAATGAAGGA GGGTGCACGT TTGATTCACG AGCGCGGCGC 
GTCGCACGTG TTCGTCAAAG GCGGCGGAAG ACTCCCCGGT TGCAAGCACG CTCTGGATGT 
TTTCTACGAC GGCAAGACGT TTCACCTCGT TGAAGATGAA CTTGTGCAGA GTGGATGGAA 
TCACGGCGCG GGCTGCACCG TATCTGCGGC TATTACTGCA GGACTGGGCC GAGGACTCAC 
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CGCCTACGAC GCGATACTGA GTGCTAAGAG ATTCGTGACT ACAGGCCTCC GCCACGGATT 
CCAAGTCAAC CAGTGGGTTG GAACAGGAAA CCTCAGCAAA TGGCGCGACC GCTTCCACTG 
ACTCAGGCGG TACATACGTG GGCGATCAGT GCTGGTATAG GTGCTTGAAG TATTCCAAGT 
CGGTTGAGAG GATCTTCTCC GTGTCGGGGA GCGATTTCTT GTACACCTCC AAGCTTTTCC 
AGAAGCCGTA GAACTCAGGA GATTTCCCGT ACGACTGCGC GTACACGGCC GCGGCGCGGG 
CGTcTGCTTC ACCc TTGATA CGCTCTGCCT CCTCGTACGC TTTTGAAAGT AAAcTGCGTT 
TTTCGTTGTC GAGCTTTCCA aGCCACTCTG CCTTCTTTCC TTCGCCTGTG GAGCGGAACA 
TTTGCGCGAT CTGGTTGCGC TCTTTTACCA TCCGATTGAA CACAGATGCT TGCAGCTCAT 
CTGAGTACTT AATCCCCTTG AAGATCACAT CGACAACGAC AATACCGAAA TCTTTTAACT 
GATCATTCGC CGCC TGTGAG ATCTCCCGCG CAAGAGACTC TCGCCCCTTT TCTATCGTCA 
TATGCGCAGT TTTCTCCGCA CCCCTATCAA AGGCAAGCTG CGACACCGGG ACGTCAAACT 
GCTCGGAGTG ATTGGACTCG TTGATAGCGn TTn 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19142 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
CAGCACATGC ACATGCATAT TCTTCCTTTT CGTTCATGAG CGGTATCCGT TTCAGGTCAT 
TCAAATATGC CGTGAGAGTA TTCTCATCGT GGCGGAACTG CATTTGTCTC ACAGAGTACT 
CCTTTTGTTC GAAAGGTACG TACCATATAC TCAGCAAATA TCATGCCATT TACGATAAAC 
CAGAGGAGAT CTGTTCGGTG TCCCTATCAC GCCACTGGAC CCTCTGTGCA TACCTCCCTT 
TCCCACGGCG CGCTACAGTT CTCTTGATTC TTCAAAAGGA AATGTATAGA ATGCGCCCCG 
CGcGGTGCGT GTAGGCATAC GGCGCAAACG TGAAGATATG ACTCGTTATG AGGAGGAACG 
CATGAAAATT AT AC CGCTCG CGGACCGTGT CCTGGTAAAA ACTGATAAAT CGGAAACTAA 
GACTGCTTCT GGAATCATCA TCCCGGACAC TGCGCAGGAG AAGATGCAAA GCGGTACCGT 
CATTGCTGTT GGTTCTGACT CGGAAAAGAT AAAAGTTTCG GTGGGTCAGC GTGTCATGCA 
CGATAAATAT GCCGGAAACC CAGTAAAGAT TGATGGAGAG GAGCACCTGC TGCTCAAGGG 
TGCTGATATC CTAGCTGTCA TCGAGTAGTT TCATCTCTTT AACGGGtTGC GCGCGCTGGA 
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GCAGTGCACG CGCGGCCACC GTCTTTCCCT GTTGCTGCAA AAGTTGTCCT GCACGCTGGT 
AC CATTCTCT CCAGCGCTGT GCTTTCCCCG CCGCCGGGcG CkCTCGAGGA TGTTCTTCGG 
CCTCCCCCTC TCCCGCnTCC TTTTGAAGTT GTCGTACTGA TTCgCGCGTT GCTTCGTCTG 
CCGCAAGCAC AAAGTGTTCT GCAGCCGCGG TTTTCTTGCC CATCCGCTCA AGCACTATCG 
CTATGTTTGC GgArCGnCTG tGCGCGCTTC CACAGCGCGT GTCGCTTCCA CCAAGCCCCC 
GAGGATACTC TCCACCCTCC CCGTACGAGA AGCTGcAGCA AGTGCCCGAT AGAAAGCATC 
CTCAGGACCA CCTCCTGCAT CACTGAGACC AAACGCTGAC TCATATTCCC CAATTCGAAA 
AAAGAACCAA CGTGCATACC GACGCACtGT GTGTCCACAG GgAAGACTTC AAGCAACTGC 
CACAGCGCGC CCCGACTGCC CTGTAGTCTC TTTTGTGCAG GATAACAAAA ACGAAAGTAT 
TCCAATGCAA AACGCACATC TGCCCGCCCA TGcATCGCCT C TTGTGCT AA TTCCGCCAGC 
AATGACCGAG CCTCCACAGG AAAATGGGGA GACAAAAAGT GCTCCTGTTC CATCTGCACA 
G AAT AC ACCC CCCTTTCAGA GAGCAATTCC GTAACAGAAT CGCGCTGTCT GTGTGCTGCA 
CGAAAGGCAA TGCTCTCTCG CGCATAGCAC GCAACGGCAG GATAGTACGT TTTATCCCCC 
TCAAC ACAC T GCGCGAGCAT CCGAACACGC TCCTGTGCAT GAGGAGCAGT GAGTGCTAGA 
TTG TAG AG TG CATGTGTGGA TGTTCCAGGA AAACGGTCAA CGTACGCGTA CCAAAAAGCA 
CGTGCACGCG CTCTATCCCC TCCATCGAAA GCGGCATCTG CCGc TAACAG CAAGTGAGTA 
CGGGCGTGCG TCGAGGCGGC AGAGTACGTA CCGAACAAAT CTGCACGTGC AAGAC TTATG 
GGGAGCAACT CAAAGACCAA AGAAAACTGT CCTGCGTCAT AGGCGACTGT GGCCCAGAAT 
GCCGTATTCT CCGGTGCCTC AGTACCCAGT ATACGGGAGT ACAGACGGAA CGCAGCATGC 
AACTCCCCCA CGCGGGCGTA CGCACACGCG GCATTCTGGA GAAAGGCATG CATACCAGTG 
ACACCAAAGG CTGCAAGCaA AGCGCAGGAG CAATGCGGGC AAGTGCGATG CCAGTTGTAT 
GCCGTATTGG CGGCGACGGT TCTACACGAA ACTGCTCTTC TCCCCGGTCA AGAGGGAACG 
CCGTCACGTG CGCTTGATCT CTACCCCCCT GAACCTGAGA AAACTCAACA GGCTCCTCTG 
GCACGTGCTC GGCTCTCTCA CTGTGAAAAG AGCCCCGAGA TGGAGAGTCA CCTCCTGTGC 
GCGAACCT AT GAGAGCAGCC TTTATCAGTG CCTCTGCGCC AATACTCTCA TATCCCTTGT 
GCACCACCCG GGGGACATAG GAGATTGCTT CCTCGAAACG AGCCTCCCGC AGGAGAAGAT 
GTATCACCAA TGCAGCAAGC CTGCCATCAT CTGCACTTCT TCGTATGCCA CGCTGGAGTG 
TACGCAtGCC GCAGCGGAGG CTGAGAGTTC CATCTGACGC TTGGCAATAC TCAGATACTG 
CGACGACTCT CGCGCGTGTC CCACCATACG CGCAAGACCG CGCAAGGCTG CGCCTTTGCG 
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C t TCCTGAGC AATCAGACTA 
AAACGGAGCG ATAGATGCAT 
GGAGGTGGCG CTTACTACGC 
AAGGAGTCGT CcGTACCGAA 
GGGGGAATGT CTTTTTGAAA 
ACCTTGTTCA AACGCACAAA 
TCCCGCAGGT GCTCGAGCGT 
GAAGGAGGCG GGTTTCTCCG 
GCCGCGTCCC AGGCAAAGAG 
TTTGGGAATA CTTCACTCAC 
CCTTATAGGT ACCAGACTTC 
CTTTTTGTTG GATTTGGGAA 
TGTCAGGCTC CACCAAATCG 
TGAAGCACTG ATAGTCATTA 
CAAAAATTTC AAGCAACGCC 
TCCCCAGGTA TATTTCCCCG 
TGTCAGCCGT GGCTTTGACA 
CCTTTGCAGC AGCAGGACCT 
AGGGCGGAAA AAGGCCTGTT 
TCCCTTCAAA TTCCGAAACT 
CTGCGTCTTT AGAAACTTTT 
CTGATCGAGC GAGATATTCT 
CGCACGTCCA ATCTGTTGAG 
TACCTGGATA CCCGCCTTTT 
CATGAATTGT CTACGCTCAG 
AATTCTTGCC TTAATCTGTC 
GATAGGCTGC AGCGCCGACT 
GGGAACTATG TATCTGCCCA 
ACACGCCGGA GAAACTATAC 
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TCTGCACGCG CAAGTCTCTC 
AGCC CAGCAA TACCCACCCC 
ACTTTTAGCA ATGTTATCGA 
ATCCCGAGCG ATACTGACCG 
GAGTAGCGAA TACGCACTTA 
ATCCCAGTGT TCCAACCGTG 
ACCGAGAAAG AGGAGTCGGG 
CAACCAAGTA AACTGAGTTA 
AGCCTGAAAA GCCAGAATCC 
CAGTCGAGCA CCTTGCCGAG 
CTCAGcTCCG CCGACACTTC 
GTAAGAAGAT TCTTGATAAA 
CTGAGCGTTA GGATCTTCGC 
GCCGTTTCAT TCACGTCCGA 
TCCATCGTCA GACCAAGCTG 
GCAGAGTAAC CCGCCTGTGC 
CCGGAACTTT TAAGTTTCTT 
GAAACtTCGG GACAGAAATA 
TATTAAGCTC GTAGTAGGAA 
CGTCTGCCTT CTTTTGTGCC 
TGTATTCTGC CATAGTCATG 
GCTTCTCCTT GACGTAATTG 
AAAGCATTCC ATTAAAATAC 
CTGCCGCTTG AGCAAAAAGC 
CAGTGCTGAG CTTCTTTCCC 
CGAGCGTCAc CGGCTCACGC 
GCGCGTGCGC AAACCCCATA 
TGAGAACTCC CCGTAACAAC 
GCAGCAGGGC AAGTTACGAC 
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CAAAAAGGGC CGACTCCCCC 2460 

AACTACTCCA AGGTAAGAGA 2520 

GCACACCATT TACAAAGCGG 2580 

CCTCGTGTAT AACGACGACG 2640 

GGCGCAGGAT GGCCTTGTCC 2700 

AACTCACACA ACCGTCAATT 2760 

AAAACCCCAA ATCCTGGGTA 2820 

ACGTCTCCGG CGTGATGCCC 2880 

GAGCGCGCCT CCTCCCTATC 2940 

CTCTGCGTCA GCCAAGAATA 3000 

CCGGATAGCG TCCTCGAGCG 3060 

CTCGTAAAGC GAGACGGTCT 3120 

TTCCTCTTTC TTCAAAACTA 3180 

AACAGCACCA ACTCCCATGC 3240 

CGTGGCAGTA ACCGCAGTCT 3300 

ACCATTTGCC TTACTCTTAA 3360 

AACAAACTCT TGCGCTTTCG 3420 

AGAAACAACT TAACCGTGTC 3480 

CGGATTTGAG AGTCCTCAGG 3 540 

ACGTAGCGCT GCGTGCTCAC 3600 

CCGTTTTGCT GCTTCATGAA 3660 

GCGAACTCAG CCTCCGTTAC 3720 

TGATTCACTT CAGCATCCGT 37 80 

TTTTCGTCAA T AAGAC TGTC 3840 

ATCTCTTTCT CAATCGCAGA 3900 

CGGAATAAAT TCACTTCGGC 3960 

CCCGCCACGC ACAACAGAGC 4020 

CGCGACTcAG TCACCAGCAC 4080 

TTTTCCCCGC TTTTCTCAAG 4140 
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AATCGCGCGA ATACGCCGCA CCCCCGCCGC ACTCGAC TGT TCCTTTTGAA TAGAAAACCG 
CCCGAGCTGC CCGGTACGTG CTACATGTGG ACCGCCACAC ACTTCCCGAG AAAAAGTTCC 
TATGGAGTAC ACCTTTACAG TTGATTCGTA TTTCTCACCA AACAACGCCA CGGCACCAGA 
ATTCATCGCA TCTTCGAGCG ACATCACTTC ACAGCACACC GGCAAGTCTG CCCGGATCTG 
CTCATTCACC AACTGTTCCA CCTGCACCTT CTCCTGCGCA CTCATCGGTC TTGGGTGAGA 
AAAGTCGAAA CGCAGGCGTT CTGCCGTAAT ATTTGAGCCT TTTTGCTGCA CGTGCGTACC 
AAGAACCACT CGCAATGCCT GGTGCAGCAG ATGCGTCGCC GTGTGGTACG CTGTCGTTTC 
CGCTGAATGA TCAGCCAACC CACCCTTAAA TACTCGCTGT GCACCGATCC GAGAGCACGC 
CTGGTGCGCC TGAAACGCGG TGTCAAACCC TGCACGGTCC ACC CGTAAAC CCGATTCACG 
CGC AAGTTC C TCGGTCAGCT CAAGGGGGAA TCCATACGTA TCGTATAGCC GAAAGGCAAC 
TGACCCAGGT ATTTCTCGCT CTGTCCCCTG TAAAAACTTG GGTATCATCC TCTCGTACTC 
TGCCTCACCC TTCCTGAGGG CGTCGAGGAA CTTACGTTCC TCGTTTGCAA GCTCCTGCGC 
AATACACGTA GCTTTCTCTT CCAGTTCCGG GTATACCGCA GCGTATTGCC CAATCACCAC 
GCGCGCGAGG GAGGACAGGA ACTCCCCATC GATACCGAGC TTCCTTCCGT GGCGGACTGA 
ACGGCGAATG ATTC TGCGCA GTACGTAGCC TGCACCCACG TTAGATGGGC GTACAGGGAC 
AGGATCGCCG AGGATAAAAG TGGCCGCACG GATATGATCG CATACAATCC GCATGGATAC 
GTCGTGCGCT CCCTGACACC CATACCTCTT CCCACATAAC TGACCTATCC GCTCCAGGAG 
CGGGGTAAAG ATCTCCGTAT CATACACTGA CCGCTTGCCC TGCAAAACCG CGACGGTGCG 
TTCAATACCC ATACCGGTGT CCACACAATA ACGTTCAAGC GGCCGGTACC TGCCGTCTGC 
GTCCTTACGA TACTGCATGA ACACGTCATT CCAAATCTCT ACGTACTTGC CGCAAGAACA 
TCCCGGACGA CAGCTCACAC TGCAAGGAGG AACTCCAGTA TCAAAGAATA TCTCGGTATC 
CGGACCACAT GGCC CTGTTT CCCCCGTAGG TCCCCACCAG TTATCCGCAC GTGGTAAAAA 
ATGAATATGG GTGCGCGCGA TACCAAGTCG TTCCCAGATA GCGGCAGATT CCTCATCACG 
CGCAACAGCC TCATCCCCTG CAAAAACAGT CACCGAAAGC CGGTCAGGGG ATATGCCGAG 
CCATGGAGCA CCAGTAAGAA ATTCAAAGCT GAACGCGATT GCCTCCTCCT TGAAGTAATC 
GCCCAACGAC CAGTTACCCA ACATCTCGAA AAAGGTCAGA TGCGAGTTAT CGCCCACCGC 
ATCGATGTCA CCGGTGCGCA GACACTTTTG CGCATTGACC AAGCGGGTAC CAGCCGGATG 
TGGCTCACCC ATAAGATAGG GAACCAACGG ATGCATGCCA GCAGTAGTAA AAAGCACGGT 
AGGATCGTGC TCGGGCACAA GGGACTTACC CGAGATAACC ACATGAGCCT TCTGGCTAAA 
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GAAGGCGAGA TAACGCGAGC GTAgCTGATC GGCGCGAATA GGAATGCTCA TGGAGGGTAT 
TATCGCCTTT TCCCTGCTGC GGTCAACATC TGACCCTAAA CGGGAAAAAG AAACGGGGAC 
TCTCTGAGCA ACCTTGCGAC AAGATCCTTG ACAGAATTCG CACACACCTC TAGCCTCTCG 
CAAGACAGTT TCGCACCCTT AAAAAATATA AGGAGCACAC ACATGACCAC GTCTATTGTG 
ATCGGTGTTG TCCTTGTTAC TGTCGGTTTA AcCTTCGGAT GGACCATTCG CTGGCTCTAC 
GCCAGATTTC ACTTATCCGC CTGTGAGCAA CGTGCAGAAC GTATCCTCCA GGAGGCACAA 
AAAGAAGCTG AATCCAAAAA GAAAAGCATT CTCCTTGAAG CAAAAGAATA TGTCCTTCGC 
GAAAGAAATC AGCAGGAACG AGACGACAGA GACCGAAGAG CTGAGCTGCA GCGTGCAGAG 
CGACGCCTTC TTCAAAAAGA GGAAGCCCTC TCTACGCGCG CGGGGGAGCT TGATTCTCGA 
GAACGATCGC TAAAACAGCG GGATCAGTCC CTCTGTCAAG AAGAGGCCCG CTATCGCCAG 
GAGCTCGAGC GTGTCTCTGG CCTCACTCAG AATCAGGCAC GGGATCTCAT CATCAAAAAC 
CTTGAGAACG AGGCGAAgCA CGACGCACAG GCTCTCATCA ACAAGATAGA GGAGGACGCG 
GCTTTGAACG CTGAGCGTCG CGCGCGCGAC ATCCTCGTTA CTACCATGCA GCGTATTACT 
GCTGATGTCA CCGG TGATGT GACCGTCTCT ACGGTGAATC TACCCAGTGA AGAAATGAAA 
GGACGCATCA TTGGGCGCGA GGGACGTAAT ATCCGCGCGT TAGAGACACT CACTGGTGCT 
GACGTTGTCG TAGATGACAC ACCTGAAGCT GTCGTCATTT CCTGTTTCGA CCCGGTACGC 
AAAGAGATTG CGCGCATCTC TCTTGAGCGT CTTGTAC TTG ACGGTCGAAT CCATCCGGCG 
CGCATTGAGG AAATTGTGCA GAAGGTG AC G CAGGAAGTTT CTCAAAAAAT CTATGAGGAA 
GGGGAGAAAG TGCTGTTTGA CCTCGGTATT CACGATATGT GTCCCGAGGG GGTACGGGCA 
CTGGGGCGCC TGTATTTCCG TACAAGCTAC GGACAGAATG TACTCTACCA CTCAAAGGAG 
GTGGCTCTGC TCGCTTCCAT GCTCGCCTCG GAAATCGGCG CAGATGTTGC CATTGCCAAA 
AGGGGCGCGT TGCTGCACGA TATTGGCAAG GGAGTGGAAA CTGATTCAGA CCGCAACCAC 
GCAGAAATTG G T ATGGAG AT GGCTCGCAAA ATGAATGAGG ACCCGCGAGT GGTAAACGCC 
GTTGGTTCTC AC CACAACGA CATAGAGCCG TGTTGTGTTG AGTCTTGGCT CGTTCAGGTA 
GCTGATGCTA TCTCTGctGC GCGTCCTGGT GCTCGGCGTG AAATGGTGGA TCacTACGTC 
AAGCGTCTAG AAAACCTCGA GGCGATTGCT GAGGGGTTCT CGGGTGTAGA GAAAGCCTAC 
GCTATTCAGG CCGGGCGCGA GTTGCGTGTT TTAGTGAACA ACGATAAAAT CCCCGACAGG 
GACGTGAAGG CACTTGGACG TGACATCGCA AAGAAAATAG AGAGCGACTT GAAGTATCCT 
GGGCGTATCC GGGTCACTCT TATTCGAGAA ACGCGCGTCG TGGAGTATGC CCGCTGAGCC 
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TCAGGGAGAG GGGAGAGAGT GCACGGGCGT CCGTGCAGGT TTGCATCGGC TGCAGTGACT 7 680 

CTCCTACCTC CCTATTCTAG TCCGGCGATA TTGGTCAACA AGGCACATGG GAGTATCATG 7740 

GCACAACAGC GTATTACGTC TGATATCTTT GCTCAGCTGC TCACCCTTTC TCACCTCGAA 7 800 

AGCAGCGAGT GTGCAGTAGG ACTTGCAACA CAGATCGAGG ACATTATCCA GTATTTTTCC 7 860 

GTTGTAGAAC AGTTCGACCC CGGTCCACGC GACGATCCTG ACACGGATAA CGCACAAGGC 7920 

CGTTGCTCCC AGGGGAATAA AATTGACGTG GACTGCTGCC CGGACTGGGT ACGCAAGGAT 7980 

GTCGCATTAC CTGGTCTTTC CGTTCACGAT CTCAAGCGGT TGTCCACAGA GTTTGCTGAC 8040 

GGTTAcTtTy kCGCAcCGCG CGCGCTCGAT GGTAGCGCAT AAATGGACGC GCATGCTATT 8100 

ACCTGTGCAA GCTGGAATAT GTTAAAGGCT CAGCTTGAAG CCGGTGCAAT CAGCTCTTTG 8160 

CAGATTGTGC GTGCGTTTCG CAACGTATAC GAGGAAGACA CACGCAGCGC GTCCCCGCTT 8220 

GGGGCTTTGG TCGAGTTTTT CTCTGATGCG GAGGAGCACG CGCGTACGGC AGACAATCTC 8280 

CGTGCCTCGT GTGCCCAGAG TACTAAAACA GCTGGAGCAA ACGGGGGGAG TGTCTCAGGT 8340 

AAGCCTTTGT TAGGTCTACC CTTTGCTGTC AAGGACAATA TTTCAGTGAA AGGAAAGCAC 8400 

TGCACGTGTG GCAGTAAACT CCTTGCAGAC TATAGGGCTC CGTACGATGC CACCGTTGnT 8460 

TGCcGnCTGC GCGcCGcAGG TGCTaTCCCG CTCGGGAGAA CGAACATGGA TGAGTTTGCT 8520 

ATGGGCTCTT CCACCGAGTA TTCTGTTTAT GGGCCGACGC GTAATCCtCG GGATCGGAGC 8580 

CGCACCAGCG GGGAAGTTCC GGCGGTTCGG CTGCCGCCGT nCGCAGGCGG TnCAGGCACC 8640 

GTTTGCACTC GGTACCGAAA CGGGAGGCTC GGTACGC CTG cCAGCTGsTT aCTGCGGCCT 8700 

cTATGGCtGA AgCCGACcTA TGGTCTCTTG AGTCGATATG GGGTGGTTGC CTTTGGCTCC 8760 

TCTCTAGACC AAATCGGCTT TTTTGCTACC TGCATTGACG ATATTGCCCT CGCCCTCTCC 8820 

GTCACCTCAG GGAAAGACCT GTACGACAGC ACGAGCACTT GCCCCCCTCC TGCGACGGGG 8880 

CGACACGCTG TGTCTCACCA TCTTGCCCCT TTTTCTGCCC ACGAGTGCTC TATCCTGCGT 8940 

GCTGCTGTTC CCCGCGAATT AGTAGATGCT CCTGGCGTGC ATCCTGACGT GTCTGCGCAA 9000 

TTTCAACGCT TCCTCACCTG GCTGCGTGCC CAAAACGTAC AGGTAGAAGA AGTGACGCTT 9060 

CCTGCACTAC AGGCGGCAGT GCCTGTATAT TATCTTGTCG CGACAsctGA AGCCGCCAGC 9120 

AATCTTGCGC GTTTTGACGG TATTCGCTAC GGGCAGAGGG GAGACACTGA TGCTCTTTTG 9180 

GAAAATTACT ACCGCGCCGT CCGTACCTCA GGCTTTGGAC CCGAAGTACA GCGAAGGATC 9240 

ATTGTGGGGA ATTATGTTCT TTCACGCCAT TTCTCCGGTG ATTATTACCG AACGAGTGTG 9300 

CGCGTACGTT CGCGTATAGA ACAAGAATGT ACGCAGCTCC TCTGTTCCTA CCACTTTATT 9360 
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GTTTGTCCTA CTGCCGCTAC CGGTGCCTTC CCGCTTGGAG AACGCATACA TGACCCGCTG 
GCCATGTATT GCTCGGATTT ATTCACCACC TTCGTTAACC TTGCCCGCCT ACCGGCGCTA 
TCAGTACCAG TGGGAACATC AGGCACTGGC CTACCCATCG GAATACAGAT TATCGGTTCT 
CAGTGGCAGG AGTGTGCCGT TCTCCGGCTA GCAAAACGTT GGGAGGAGGC ACCTCATGTC 
TGACCTCCAA ACAGGCACAG TTCCCTCCAT TGCAGGCGCC ACAGATGACA CACATGCCGC 
ACCCTTTTTC TACGAGGTAA TTATTGGCTG TGAAATTCAT TGTCAGCTTC TAACAAAGAC 
CAAAGCTTTC TGTGCtGTGC AAATCGCTCA GGAGGAATGC CGAATAGCCG TGTGTGTCCT 
GTGTGTCTTG GGTTGCCAGG AGCGTTGCCC GTTGTGAGTG AAGAGTACGT GCGGCTCGGG 
GTGCGCGCCG GACTTGCGTT GGGGTGCACT ATCCAGCTTT GGTCCGCTTT TGATCGCAAG 
CACTATTTTT ATCCAGATCT CCCAAAGGGT TATCAAATTA CCCAGTACGA CGCTCCCTTG 
TGTACGGATG GTGCAGTGGA TGTACAGGGA GTTGACATGC CCGTGCAGCG cGTGTCCGTA 
TTGAACGGAT ACATTTGGAG GAGGACGCAG GCAAAAGCCT GCATGCTGCA GACGCTTACA 
GCTATATTGA TTTCAATCGT TGTGGGGTGC CGCTCATTGA AATTGTATCT AGGCCGGATC 
TGCGCTCTGC AGAGGAGGCC GCATGTTTTA TGCAGACGAT CCGCGAGATT CTCACCTTTA 
TCGAGGTAAC GGATGGTAAT TTAGAAGAAG GCGCACTGCG ATGCGACGCG AATGTTAATG 
TGAGGATTCT GTACAAAGGG CAAGAACACC ACACTCCCAT TTCTGAAATC AAAAATATGA 
ACTCGTATCG TATGGTGCGG GACGCGTGTA CGTATGAGGT ACAGCGTCAA TTGCAGGAGT 
TTTGGCAAAA GGGTCCTGCG AGCAAAGAAG AGATGCAGAG AAAACGCACG ATGGGCTGGG 
ATCCGGTCGA AGGGGTTACG CTTTTACAGC GTACAAAGCA CTCACTGCGC GATTATCGTT 
TCATGCGCGA TC C AG AC TT A CCTGACCTGC ACTTGACCCC TGCATATGTC CAGCATCTCT 
CTTACACAGT CGGGGAACTT CCGGCAGCGC GGCGTGCACG TTTCAAACTT GACCTTGGCT 
TGTCGGCGTT TGCAGCCCAA ACGCTTACCG GCAGCCGCAT GCTCGCAGAC TGGTTTGAGA 
AGGCAGCGCA TGCGTCTAAG AATGCGCGAC GAGTGGCAAA CTGGATTCTG TCGGAGGTTC 
TTGCGGTAGT AAACGAGAAG AATATCTGCA TTGCAGAGCT CAATCTGAGT CCTGAAGCAA 
TTGCCGAACT AATGGATGCA GTTGAAGATC AGCGCATTAC CGGAAAACAA GCAAAGGATA 
TATTTGCACA AATGCTTGCC ACCGGTGCGC GAGCGCAGGA CATTATCTCC GCACAGGGTC 
TGGCACAACT TTCAGATGAG GAAGAAATCG CAACGTTAGT GCAGACGGTG TTTCAAGAAC 
ATCCAAAGGc nCTGCGTGAT TGGCAACACG GTAAGACAAA CGTGGCTGCC TGGCTCATGG 
GGCAAGTAAT GAAGCGTTCC CGCGGGCGCG CACACCCTGC GCGAGTGGCG ACGCTCGTCC 
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ACCAAGCACT CTCTCAGCTG TAACAGCTGG AAAAACTCCA CGGAAGAGCG 
TTTCAGCATA CGCCCGGyCC CGcTCACGCC AGGAGAAGAA AGACGCACCA 
ACGCCTTCGC CTTGCTAAAT ATCTCCGCAA TTGTCTGTGC GAGCGTACCC 
GGGTATAAGG AGGCGGTGTG TCGTGTTCAA CCCCTATAGG TACATATATC 
CCAGACCATA CGCCGTCTTT AGCCGCGTCT TTAGCCGACG CACAGGTCGA 
ATAAACTTAC CTCACCGATA AACGCGGCAT TTGTTTTCAC TGGGGTGTTT 
AATACAAGGC CATTGCCAAC GCCACATCCA CCGCAGGCTC ATATAACCGG 
CCACATTCAC GTAGATATCC TGATCTGAGA ATTTCAAACC CACACGTTTC 
CTGCAACACG ACTGACGCGG GCCGAGTCGA TACGATCAGA AAAAACGCGC 
TTTTTGCAGG AACGGTCAAT GCCTGTATTT CTACCATAAA AACACGGCTC 
CGGGCACAGT TGCAGACCCA ACAGGAAACA TTCCCTGCCT GGTACTAATA 
CAGTGTCCTG C AC AGCGGAA AGTCC ATTTT CACCCATGGT AAAAATACCC 
CAGAACCAAA TCGATTTTTC AATGCACGTA AAAAACGAAT ATCCTCTTCA 
AAGAAATCAC AGTGTCCACC ATATGTTCCA CTACTTTTGG CCCGGCAATA 
TCGTTACATG CGCAGTAAAA AAGAGAACAG AGTCCCGTTC CTTTACCCAC 
CATTTGCGCA ATATTTCAGC TGATTGATAG TCATAGGAAT GGCACCTGCT 
AAACTGTCTG AATCGAATCA ACAATAACGA AGGTAGGGCA TCGTGTATTT 
CGACATCCTC GACCCGCGTC GCACAAAGCA ACTCGATGTT CTGAATTGGA 
GATCCGCACG CCCACGAATT TGCCCCGGAG ATTCTTCACC CGAAACATAG 
TCCCGCAGgC TGCAGCGATT TGTAACAGTA ATGTAGATTT ACCAATGCCC 
CAATCATGAT CGCGGAGyGT CTTACGGCGC CTCCGCCGAG GACACGATCG 
TACCACAACT AATACgCTGc gCATCCTGCG CGCGCACAGC ACACAGCGGG 
CAGGAGAAGA AGATGCCTTT TTTACAGCAC GAACATCGCC GGAGGACAAC 
CTTCGAAGGA ATTCCACTCC CCGCACTCAG GGCAACGCCC AAGCCACTTA 
AACCACACCC CACGCAGGAA AAGGCACGTT CCGTCTTTTT AGCCACTCCA 
GTCAGAAAGA AAAAGCACTG CACGGCGGGC CGctACGCAC CCTCCGTTTC 
CAGCGACGAA -CGAGCGAAGG AGAAAATTGC CTCCGCTGCA ATGCACGCTG 
GGCCCATATC CCCGCcTTCG CAGCTTCTCC AGCAACCTCA AGCAGAGCGT 
TGCTCTTGAA ACAAGAGATC TAACGCGCCT TCTGCATCTG CATGAGAAAC 
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TTTAACTCAC CTAATAGCTG AGCACGGGAC GCAGGACGGG AATCGACGCG GTTTCGCAAC 12900 

CACGCCCGCG CGAATCGcGT ATCATCAAGC CAAGAATATC TTTTAAGAAC CGGGAACACG 12960 

CTCTCAACGA CCCTCTTCTC AAAGCCTCTT TTCAAAAGCT TAAAACCCAG CTGCTGAGCA 13 020 

CTCGTCTCAC TCCGAGCGAG CAACCGCACT GCGACGCACT CAGCCTCATA ACACCTACAT 13080 

GCAAAGCACA CCGCCCCGTa CTGCTCATCA GTGGGACGAG TACCCACCAA CTCCTCTATC 13140 

GGGCAGGAAA GCGCGCCAAG GTAACTCAAG CGAGTCTGCA GAACAGCACC CACCTCATCC 13200 

GTAAGTTTAA GCACATCCTC CTGGAGGCTC TGGATAGCAC AGAGACAAAA GCGCCTATCG 13260 

CTTACTGAAT TGAAATCTAC GCCGTGCACC GCGCTGGCCA TACTTCTTGC GCTCTACCAT 13320 

GCGAGAATCA CGCGTGAGCA ACCCACCTGC ACGCAGCGAA GCCTGATTTG AAGCGTCAGC 13380 

ACGCACGAGA GCGCGCGCAA TACCATGcGC ACACGCACCA GCCTGCCCGT CAAGTCCGCC 13440 

GCCATACACA TTGACAATCA CATCGTAACG CCGCTCGTTC GCGGTAgCGA AGaGGGGTTC 13500 

GCGCACCCGA CGCAATTGct CCGCCGTAGG AAAATACGCG CCGACATCCC GTCTGTTGAC 13560 

GGTAACATTC CCGTTCCCCA TACGGATACA CACGCGAGCG ACAGCCGTTT TCCTTCTCCC 13620 

TGTTCCGATC CCAAGATTCT TCACGCTCTT TACCACTCCT TAACACGACA GCGGCACGGG 13680 

ATTCTGCGAC TCGTGCGGAT GCACAGATCC CGCGTATATC TTCACATTCT TGATAAGCTT 13740 

GCGTCCCAAA GGCCCCTTGG GTAGCATACC CTTAACCGCG TGACGCAACG GTTCGACAGG 13800 

CCGGCGCTTG ACCAACGCGC TAAACGACAC GCTCTTGAGC CCCCCAGGAT AGCCTGAGTG 13860 

GCGGTAGTAC ATCTTGTCCT TCGGTTTAGT CCCACTCAGG AACACCTTCT CAGCGTTGAT 13920 

TACCACAACG TAATCACCCA TTTCCTGGTT TGGCGTGTAC GATGCTTTAT GCTTACCACG 13980 

CAGGAGACAC GCAACGCGCG cGGCAACACG CCCCAAGGGG CGCCCCGCTG CGTCAATCAG 14040 

GTGCCAAGCG CGAACAGCCT CGCGCTCATT AACAAAGATT GTCCTCACCC CTCTGCTCCT 14100 

TTATCCACAC GCTCCGGTAG CTTGCGCGTG GGCGCACGAT CACGCATGTT TGCCACCCCT 14160 

ACAAGGAGAG ATATAAAACC AGCAAAAATA CCGCCGACTG GCGCAGCACC ACGCAAGAAC 14220 

CGCAGTGCAT CCTGCGCCCA CCCGAGACCA CAACCGTACT CCATAGGGGA AAGAAGAGCG 14280 

AAAACCGCCG CTCCCATCAG CAGCACACCA ACCAAAACCG CGCCCACCAC CGCctCCCGC 14340 

GCAAAATCTC CCGATCGTCT CATAAAAATC ATTCTGTGTC AAGnTGCGCT AGcACAAAGA 14400 

CAGTTAGAAT GCGTATACTC TCCAGATAGT TGCGGAAAAC TTCCTGAAGA AACCTTCTGC 14460 

TGAAAGAAAT cAGCTAACTT GAACAAACGC TCAGTCCGCC TGATGATGGG CGCATAACGC 14520 

ATGCAAAGAT TTGAAGACAT TCCGTACACG CGGCCACACA TGGACAT AC T CGAGCATGCC 14580 
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GTCGACGCGG CTCATGGGGA GTTCGCGCAG GCCCGTTGTG CACGCGATGC GTATGCGTCT 
ATACTCGCCA TAGAGGATCT CCAGCGCCAA TATCTTACCG CACAGGCACT AGCGAACATG 
CGCTGTTCCA TTGATACACG CAACACCTTT TACCGCAGAG AACAGGATTT CTTCGATGCT 
GTACATCCTC GCTTTGCCCG CTTAGATCAT GCCTTCAACC AGCTGcTGCT TGCATCACCA 
CAGCGCGACG GTCTTGAAAA ACTTATTGGC ACTCACCGCT TTACCCTTGC ACGCCTTCAG 
AGCAAAACCT TCTGcTCGGA GATTATGGAA GACCTCGCAG AAGAAAATCG TCTCACCAGC 
GCCTATGAAA CACTCCTCGC TTCCGCACAC ATTCTCTTCC GAGGCCACCA CTACACTCTC 
GCCCAGCTGT CCCCCTTTAT GGAACACACC GACCGCAACA CGCGGcGCGA CGCGCATGaG 
GCATACTATC AC TTCTTTGC TCAACATGAA TCGGAGCTCG ATACCCTCTA TGACACGCTG 
GTACGAGTGC GCACACGCAT CGCACGCACG CTCGGCTATG ACAATTACAT CCAACTCGGc 
TATGACCGCC TGTTACGCAG CGACTACGAT ATGCAAGATA TTGCGcgTTA CCGCACCTAC 
ATCCTGCGct ACGCCGTACC CCTCGCTGCG GAACTACATG AACAACAGCG ATCTCGACTT 
GGACTCAGTG AACTTCTCTT TTATGACGAG GCGTTGTACT TCCCTTCTGG AAATCCAGTT 
CCCCAGGGAG ATGCACCCTG GATATTGAAT CAGGCCGCTT GTATGTACCG CGAACTGTCC 
C C AGAAAC AG ACCAGTTCTT TACCTTTATG CGCGAGTACC ACCTATTTGA TGTCTGTGCA 
CGTATTGCAA AAGCGAGCGG TGGATACTGC ACAACCTTGA GCACATATCG TGCGCCTTTT 
ATTTTTGCAA ACTTTAATCG CACTGCACAT GACGTGGAGG TTATGACGCA CGAGGTGGGC 
CACGCCTTCC AAGCCTACCA ACGCTATCGA GCGCGTCTTG ATCCCTGTTT GGAAGCGTAT 
GTGTGGCCCA CGTACGAAGC GTGCGAGATC CCCTCAATGA GTATGGAATT TCTCACCTGG 
CCGTGGATGG GGCTCTTTTT TGGTGAACAG AAAGAACGCT TCTACCTGCG CCATTTAACA 
CAGGCAGTGG AGCTTTTACC GTACGGGGCA GCTGTGGACG AATTCCAACA CTGGGTGTAC 
GCACATGCGG ACGCTTCTGC CACTGAACGC AAGAAGGCGT GGCGCGCATT AGAAACTCAG 
TATTTACCTC GCCGTCGGTA CGGAGGGCAG CAC TACTTGT CCTGCGGGGG ACTGTGGATG 
CGTCAAAGTC ACATTTTCTG TATACCCTTT TACTACATAG ACTACACGyT CGCGCAGATA 
TGTGOGTTGC AATTTTGGGA TCGCAGCCGC GTTGCATACA CTCACCTTTC TACTCTCACC 
GGCGCGGCAC CGTACGCCAG CATAACTCCT ACTGCCTATG CGGAAGCCTG GCATGACTAT 
TGCGTACTGT GCAGCCGAGG CGGCAGCGAA CCGTTCATGC GTCTGCTTGC CACAGCGAAT 
CTGCACAACC CCTTTGAGGA AGACACGTTT GTTTCAACAC TCGCTTCCTG CCGTGCGTAT 
TTTCGCACGg TTGGTGACCG CCTTTCCTAG GTCTATGAAA AAAGGGTAAA AGATGCCTCG 
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CCAAAAAGAG AACTACCTGT CACCGTCCCC CGTGGTCGGG ATTCTTCGTG ACTGGGGAGT 
GCTGTGCACC TTACGCTTAA AGGGGAAACA CATGAAGCTT GTCTACAGTA CGGATTGCGA 
ATACCACATT GGACTGAAAG CGTCAGACAT CGGACACTAT GTTATCTTAC CGGGGGATCC 
TGCACGAAGC GAAAAGATTG CCCAACATTT TTCTCATCCT CACAAAGTTG GCCACAACCG 
CGAGTACGTC ACGTACACGG GCACCCTCTG CGAAACACCA GTCAGCGTCA TGTCCACCGG 
TATTGGGGGG CCGTCAACTG CAATTGGTGT TGAGGAGCTC ATCCATTTGG GCGCACACAC 
CTTTATCCGC GTAGGGACCT CAGGGGGCAT GCAGCCTGAT ATTCTTGCCG GGACGgTAGT 
TATTGCAACC GGTGCGATTC GCTTTGAAGG CACCAGTAAA GAATATGCCC CCGTGGAGTT 
TCCTGCGGTG CCGGACTTTA CGGTCACTGC TGCACTCAAA CACGCTGCAG AAGACGTGCA 
GGTGCGCCAC GCGCTcGGTG TGGTTCAGTG TAAAGACAAC TTCTACGGTC AACACTCCCC 
CCATACCATG CCCGTCCATG CAGAACTCAC GCAAAAATGG cACGCATGGA TTGCATGCAA 
CACACTCGCA TCCGAAATGG AGTCTGCAGC GCTCTTTGTG CTCGGGAGCG TACGGCGCGT 
GCGCACCGGC GCAGTGcTCT TAGTCATTGG AAACCAAACC CGCAGAGCAC AGGGATTGGA 
AGACATTCAA GTTCACGACA CCGAAAACGC CATACGGGTT GCAGTCGAAG CGGTCAAATT 
ACTCATCACC CAAGACTCCC CGCGCTAGGC GCACTGCAGT GCTTTAGGCA AAGTGTCGCG 
GATCGGTTAT TTCTAACAGC GCGGAGTCAA GCTCGGTCAA AAAGCGGCTG GGCTTCATCG 
CCGTGTGTCG TCCCCACATT CTGCGGTACG CACACGCGGT AAGGTATAAG CTATCCATAG 
CCCGCGTGCA GGCAACGTAC ATCAAACGCC GCTCTTCCTG TATATCTGCT TCGTCATCAC 
GCGGAAACAC CCCGTTCTCT AGTCCGGTCA GAATCACCCG TCGAAACTCC AGCCCTTTTG 
TATTGTGAAT GGTGATCAAG TGCAtGCGTC AGCCGcTCCT CCCTCGTCGG CCATATTTTG 
GTCCAATTGG ATGTGTTCTA GGAAACTCAC TAACCCCTCA TGCGAACATG CATACAGTGA 
CGCTGCGTTC ATTAACTCCT GCACGTTGAC CGCGCAtGCG TCCCTTCTTC CTCATCCTTC 
TGTCGATACC ATTCTTCCAG CCCCGTGTGT TCCATTACCA CAGAAACAAA GCGCGCAAGT 
CCtTCTGCaT CGTGGGTGCG CTCCtCTACC GGCTCCTCAG GCGGCGCACT GGTGCGAGCT 
TCTTCTCCCG CTGCGGGGGC CTGTGGCATG CGTGCACGCA GCGCACGTAA CAGCGACAGA 
AAGCTACTGA CCTTTTGCCg CGCACGCGTG CCAAGCGCGG TCAGGTGGGT GGACTGGAGT 
GTGGTAAAAT CAGTTATGGC TGCCTGCTGT GCACAGACAA ACAATGCGTC TTGTGTCTTT 
TCTCCAATGC CCCGAGGCGG CTTATTCAGC ACCCGCCGGA GGGCCAGTTC ATCTGAGCCA 
TTGACTATGA GCTGGAGAAA CGCCAGCACG TCTTTTACCT CTGCGCGACT GTAGAATTTG 
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AGCGTGCCGA CAATGCGATA CGGAATGCGA TTCCGCAAAA AACACTGTTC AAAACTCAGC 18120 

GACTGTGCAT TTACCCGATA TAAAATCGCC CAATCCGCGT ATGGGATGCC gCGCGCACGC 18180 

kcTTCTTGAA TGAGGTGCAC GCACAGCGCA gCTTCTTCAT CTTGATTATT CAGCAAGAAC 18240 

AGGCGCGGCT TAGTACCTCC CGTGCGCTGG GCAATCAGCG CCTTTCCTAA GCGGTCTTGG 18300 

TTTTTTTTCA CTACCGAATC AGCAACACGC AGAATTGCGT CTGTGGACCG GTAGTTGTAC 183 60 

TCCAGGCGGA TAATCTGGGT ATTTTGAAAG AACTCAGGGA AGGTCAAGAT ATTTTTTACC 18420 

TCTGCTCCGC GAAAG cGAT A GATGGACTGA TCGTCGTCCC CTACCACACA AAGATAGGTG 18480 

TGCGcACCGG TGAGCACCTG CAAGAAATGA AACTGCGCCA CGTTTGAGTC TTGAT AC TC A 18540 

TCTACCATGA CCACCTGCCA CCGTGCATGC AGCTGTTCGG CGACGTCCTG GTgcTCACGC 18600 

AAGAGCTGCA CCGGAAGCAT AATCAGATCC CCAAAGTCTA CCGTTCCCAT TTCGCGCATA 18660 

CGCCGATGGT wrCACGCATA TGCGTGCGCA AACTGCCTGT CACCCAGAAC GGCTCTGGCA 18720 

GCGCATGCAG GAGCGGAGAC ACGCGCGTGC ACTGACTCAA ACGAGGCGCA GTCGAGCCCA 18780 

TAGTCCTTTG CCTGAGAAAT TCCACGCGCG AGCATGCCTG CCCGACTGTG ATCGCAATGA ' 18840 

GGCAGGATTT TTGGCAAGAG TGCACGGACG TCATGGTCGT CATAAATACT AAAATGGGGG 18900 

TTCAATCCCA GACGGACTGC ATAGCGACGC AGGATCCACA CCCCCAGTGC GTGGAAGGTA 189 60 

CAGATAGTTG CCCCTGCGCG GCAGACTCAA GCGCGCAGGC ACGCGTGCGC ATCTCAgCGC 19020 

CGCTTTATTG GTAAAGGTTA CTGCCAGAAT CTGCTCGGGG CGAACCTGCC GGGAACGGAT 19080 

AAGATGGGCG ATTTTGGTGG TGATAACGCG CGTCTTTCCT GAGCCTGCGC CGGCAAGGAT 19140 

AA 19142 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2178 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

TATCTATTTC AGTTGGATTT TGGCGCCTGG AACTGGGGAT TTTTCGTGTG CCACTTTCTG 60 

ACACGCTAGC TCTGTGGCGC ACATTGAAGG AGCGCCTcTC AgGGTTGTCG CTTCCCACGC 120 

TTGCGgTGGA CTTGCCAGGG GGTGGAGGAA AGTTTCCGCT TGTGGCATTG GCCTTGCAGC 180 

AAGATGTCAC GTGGCATCAG GAACGCGAgG CGTTCTCCGC ACGCGGcATC GATGGCGCGT 240 
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GGTACACGTA cCCGTTCTGA CCGTCAAAAC ACCCCGCGCC TCATCGAGTG GAGAGTGCGC 300 

CGCGTGGTAC AsGGGGCAGC GCGAAACACC CCTGCGTGAA GGTAGGCGGA GGAGGGGGTA 3 60 

CGGTGGTATG CACCGAACCC AGTGATGCAC CGCCGCCCCC CCCCGAGGGG TCCTCTTCTG 420 

CCCCGCGCGC CCACGCAGCG CCGCAGAACG GTGAGAAAAG AGGACGCGCT CTCTACCGGC 480 

TAACACATAT CACGCGTGCA CGCAGCCCCC TCGCTGCCGT TGATGCGAGC TTCCAAAACC 540 

TCGAGTGCAT CGCGAAGCTG CTGTGGCAGC CGCGCGCCAA ACTTTGGGTA GTGATTCTCT 600 

CGGATGTCTT TAATTTCCTT TTTCCATCCG GCTATGTCCA CCGACAAAAG CTCTTTCACT 660 

GCCTGCGTGC TTACGTTTAA CCCCTCTGTG TTCAAGGCTC CCTCTTTGGG CATCCAACCG 720 

ATCGCTGTTT CCACCGCGTT GTCCACACCA TCACAGCGGT CAAAGATCCA CGCGAGTACT 780 

CGGcTGTTAT CGCCATATCC GGGCCACAGG aAGTTGCCCt CTGCATCTTT ACGAAACCAG 840 

TTAACGCAGA AAATCTTTGG CAGGTTTTCG GCACGTGCCT GCGATCCGAG CTTAATCCAG 900 

TGCGAAAAGT AGTCTGCCAT ATGGTAGCCG CAGAAGGGGA GCATCGCGAA CGGGTCTCGG 960 

CGAATCTGAC CTACCTGGTC AGAGATAACT GCTGCAGTTA CCTCCGAGCC GATGATGGAA 1020 

CCTAGAAACA CCCCGTGATT CCAGTCCCGG GCCTGATGCA CCAGGGGAAC CGTACTGGGG 1080 

CGACGGCCGC CAAACAGAAA AGCGTCGATA GGGACCCtTC GGGATCTTCC CAGTTACTTG 1140 

CAATTGCAGG GCACTGTCGC GCAGgAGCGG TAAAACGCGC ATTCGGATGC GCAATTTCTT 1200 

CTCCTTTTGG ACTTTTATCG CGTGTAGGTG CGGGGCGCGA CACGCCGTGC CAATCAATGA 1260 

TTGTTCCTTT AGCGGGATAG CCGATACCCT CCCACCACAC GTCGCCGTCT TCGGTCAGAC 1320 

CACAGTTGGT GAAAATGGCG TTTTCCTTGA TAGAGTCCAT GGCATTCTTG TTCGAGAAAT 1380 

CAGATGTCCC TGGTGCTACG CCGAAGAACC CCGCTTCAGG ATTGATAGCG TACAGGCGGC 1440 

CGTCCTTTCC GAATTTCATC CACGCGATGT CATCGCCTAC GGTCTCGACC TTCCATCCAG 1500 

GAAGGGTAGG GATCATCATA GCCAGATTCG TTTTGCCACA TGCAGAGGGA AACGCCGCAC 1560 

CAATGTACTT GGTCTTTCCA GCAGGGTTGG TGATTTTAAG GATGAGCATG TGCTCTGCAA 1620 

GCCACCCTTC GTCTCGTGCG AGTACTGAAG CGATGCGTAA TGCGAAACAC TTTTTCCCCA 1680 

ACAGGGCATT CCCTCCGTAT CCTGAACCGA AAGACCAAAC CAAGCGCTCT TCAGGAAAGT 1740 

GAGAGATGTA TTTGCGCTCC ATATCCGCGC AGGGCCACTG GCCTGCGTCA GTTACGCCCG 1800 

GTCCTAACGG CTTCCCCACA GAGTGCAAAC AGGGGACGAA CTCACCATCA GTACCCAACG 1860 

CCTCAAGCAC GCGGGTACCC ACGCGTGTCA TGATGTGCAT GTTGCAAACG ACGTACTCAG 1920 

AATCGGTGAT TTCGATGCCA TTTTTAGAGA TGGGTGAGCC GACCGGTCCC ATGGAAAAGG 1980 
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GAATGACGTA CATGGTACGG CCCTTCATGC ACTGGGAATA GAGACCGGTC ATAGTCTTTT 2040 

TTAATTCTGC aGGATCGGTC CAATGGTTAG TGGGTCCTGC ATCATCCTCC CTTTTTGAGG 2100 

CGATGAnAGT GTTCGCTTCG ACGCGCGCAA CGTCGGAGGG CTGTGAGCGA AAGAGGAAGC 2160 

AGTTCTTACG TTTTTTTA 2178 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) . SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

TAAATATCTT GTAAGCTACG AGGAGCGAGA ATGTCTGCAT TGTTTTCCTT GGTTGCGGTG 60 

TACGTGCTTG TGTGCGCGCT GCACAAACAG ATTAAAAAGT ACGCGTCTGT CTGCTACCTC 120 

GGCAGTGCGT GTGTCAGTGT TGCGGTTGTC TGTGTCGTGT GGAGCGGCGC AACCAAGGGG 180 

AATTTTGGCG TGCGTGTGCT TCTGCATCCG CTGACGAGTG CGAGTTTTTC CACCGCGATC 240 

TTTACATTTG TGATGTGCGC GAGTGTGCTG AAGAACGGTT TGCTCAAGCA GCGTGTCATG 300 

GGGTTGCGTG CGGAACTGGC CATCACCGCG GCGATTCTCA CTCTCGGGCA TAACATCGCG 360 

CATGGAAGGG ACTACCTGGT GCGTCTGTGC GGGAGTACCG GGGATTTGTC TACAGGGTTT 420 

CTTGTCGCGG GTGCTGTCAG CATGGTCCTC GTTCTTTTGA TGAGCATTTT GGCGGTAACG 480 

TCTTTCAAGg TAGTGCGCAG GCGCATGGGC GCAAAGACAT GGAAGCGTGT GCAACGCCTT 540 

GCATATCTCT TTtACGGGCT TACGTATGTG CACCTTTCCT TTATCCTCCT ACCGACCGCT 600 

TTGAGGGGGT ATATCCCGAG TGTCGTCTCC TACGTGTTGT ATACCGTCAT TTTTGCCACG 660 

TACGCGTTGC TGCGTGTCCG CAAGGCGTTG GGAAAGCGGA AGGGTGCTTG CGCGTTGTGC 720 

TCCGCGGCGG TTGCTGTGTC CTTCGTTGCG TTCGTCTTGG GCGCGTCTCA CATGGTCAGG 780 

CACACGAGGC GTGCGCACAC GGAGAGGACT ACGCGTGCAA AGGCGCGGAA GTGTTCTCCT 840 

GCAGAGATGA AGGACGGCGT CTATGAGGCT AGCGCGCAGG tCACAACGGA AAGCTAAGTT 900 

TGAGGGTGAC AATCTCGCAG GGTAGGATTG AAGCTGTCAC CGTCGTGGGG CACAGCGACG 960 

ATGATCCTTA TGCCTCCTGG GCGGTAgAGG GTGTCTCGGC GGCAATTGTA GGGGCTCAGT 1020 

CTACCGATGT CGATGTGGTG AGTGAGGCAA CTTCCACTAG CGAGGCAATA ATTCGGGCCG 1080 

TGGAGAAAAT TCTCCAGCAA CCGCAACCGT AGATCTAAGA AGAGCGGCGG TGGGATGTGC 1140 
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TTGTTTGTCT TCGTTTTGCT GGGCGCGTGC GGGGGTCGGT GTGTCTGTGA GTAAAGGTTT 
AAGGGGTACG CTCATTGGTG TGGGCGTCTC ATTGGCTTTC CTGTTGGTGG TATATCGCCT 
CTTCGCGGGG AGCTATAACC GTATGATCTC GCTTGATGAG GCTGTCAAAA GTGCCTGGAG 
TCAGGTAGAG GCGGTCCTGC AGAGGCGCTT AGATCTGATC CCCAACTTGG TCTCGACGGT 
CAAGGGTTAT GCAGAACATG AGTCGGATAC gCTGAAGCGT GTTGCGGAAn GCcGTTCGCg 
TGCCGGTGGC GTTATGCAGG TGGGTGAAAG CGCACTCTCT GATCCGGAAA AGTTTGCTCG 
GTTTCAGCGT GTCCAAGCAG AAATAGGCGG TGCACTCGGG CGTTTGTTGC TTGTCAGCGA 
GCAGTATCCT GCCTTGCGCG CAAACGAGAG TTTTCTTGCG TTGCAAAGTC AGCTTGAAGG 
AACTGAAAAT CGTATCTCTG TGGAGCGGAC GCGCTACAAT CGCGCCGTCC AGGAATACAA 
CGCCTATATC CGTTCTTTCC CACGCAGTGC GGTGGCGCGT TGGGGTAATT TCAGCTCCCG 
CCCGTACTTT ACGGCACATG AGGGGGCTGT TGcTGCGCCT CGCGTTCGGT TTTGAGTGTG 
TGCGTTGCCG CGCCAAGAGC GGTGTGGTTT TCTTTGCGTA GGATCGGTGA TCTTCTCTCA 
TGGCTCGTGG GGAGTAAGGG GGTTCGTGGG TCTTTTTGTG CCGATCCTGT CCCTCGTGTG 
GATGGCGCGC ACGGTGGCGA TGACCCGAGG CGGGAGGAGG ACGCGTCGGG CGCTTTTCAT 
ATTCCCCGTT TTGCTGCTCG GTTAGGACTG GACTCTGCAG GACTGCTGCG TATCCAACGT 
ACGGTGGCTC GTGTCGAAGG ACACACTGAC GGAGAGGTGG CGCTCGCTCT TATCAAGGAA 
AGTGATTCGT ACTCGGTCTA TGAGCTGTTT TTCGCACTCG TACTTGCTGG CGCCTGTTTT 
TGCGTTCAGC TGAGCATGGT GTCGGTGCTT GAACAAGCGT TTGCgCGTTT CTTTTGGGCG 
CCGCCATCGT GGTATATGCC gGCGTTTATG GTAGCGTCGA GTGTGGCAGC GTGTGTACTT 
TTTTTCTTTA TCGCAAACAT TCCGTGGGTG GATCGCGCGC TGGTACCGGG ACCTCTCAAG 
CAGCAAAAGA GCTATGCgCg cGCGGTGCGC CaCTTTGTGG AAAGTGGTGT GTGCAATACG 
CGCAATCGCA CGGGGATTCT CATCTTTATT TCTGTGTTGG AGCGGCGTGT GCTGGTGCTC 
GCAGACGTGG GGGTGAGCGC GTACGTGCCT GCCCgTGAAT GGACTGAACT GTGTCAAATA 
ATTACTGCGG GGTTGCGCTC GCGTCGTGCA GCGGACGCCT TATGTGAGGC ACTTACCCGT 
GTGGAACAGG TGCTCgctAC GCgGATGCCA CCTCAGaAAA AAAGTTCCAA CGAGTTACCG 
GATGGGTTAG TGATTTTGAG TCACTAGAAT TGAATCGGTG TTCTGCGTAt GTGATTCCAC 
TAGGTAAGCG GCATGGGCAT TCTGCTGTGG CCGTGTGTTC GCGGGGTGTT TCTGTGTTCA 
GAGGAGAGAA GGCGATGGAT GGGGTGCGGG GGCGCCTGAG TGCCCTGCGT TCGCTGATGC 
GTGCGCAAGG GGTAGATATT TGTTATATCT CCGGTGAGAA CGCGCATGGG CAGATAGATG 
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GTGCGCGTGA ATATTTTTCT GGATTTACAG GCTCTGCAGG AATAGTGGTG GTGAcCcACA 2940 

ACGTGyGTTT TgTGGACAGA TGGACGGTAT TTTATCCAGG CTGAGCGTGA gcTGTCGGCA 3000 

TGTGAAGTGT GCCTTTTTCG TACAGGGCAG GCGGGCGTAC CTCGGGTAAC TGAGCTTTTG 3060 

CGTACAGAAC TCCGTGCTTT TTCATCCGGG CCTGGGCACG GAGGGGGAAC GCTCGCAGTG 3120 

GATGGCCGTA CGATTTCTGC GGCTGTATGG GAGCAGTTTC AGCAGGAGCT TGTGGATGTA 3180 

TCGctTCGCC TAGACTTTGA TGGGgCsCTC CTGCTACCGC AGaGCATCGC t TTCCGTA t T 3240 

CCTTCCCtGC GTTTTTGTTG GATGAGCGCT ACACGGGgTT GAgTGCGGCG CAAAAGCTCA 3300 

CCCAACTGCG CGCAGCGCTC AgTGCACGCA GCTGTGATGC AACGGTGTTG TCCACATTGG 3360 

ATGATGTGTG CTGGCTCACC AATGTGCGCG CACACGATGT GCCGTGTACA CCGCTGTTGG 3420 

TGGCATACAT GGTCGTCACG CACACCCGTG CCTTTCTTTA TGTGGATATG CGCAAAATTT 3480 

CTTCTGCATT GCATCAAGCT TTGTATGCGC AGGCGTTGAG TGTATGCCGT ACGATACTTT 3540 

TTTTGATCAG GTGTGTGCGC GCTGTGGGAT CAGGAACCGA CGGTACATGC TGTGGGGAAA 3600 

GGAAGAGCAG GAGTGCAGGA AG TGGCAGGC AGAACGCCAG TGcGTGTTGT TGGACTTTGA 3660 

GCGTTCGTGT GCTGCACTGG TGGATCTCTT TCGTGCATCG CCGCAGtGTG TAGACCGCaG 3720 

t GGAAcGATC CGCTCCTTCC GGTTCGTTAT CTTCTGTGCT GGGCATAGAA GAGGATGGTA 3780 

CTGAGACCAG CAG&GGTGGG AAAAGTGCAT GTGCGTTGCA GTCTGCaCGC GACGCTCTGG 3840 

CAGCAGGGAA AGAAAAGGAG AACAAAGAGG AGCGGGGACA GACCATGTGC TTCTCCGTTT 3 900 

GTCGTGGACT TTTGCCAACT GTAGCGCTCA AAGCATTAAA GAACGACACC GAGCGAGCCA 3960 

ATGTGCACCA GGCAATGATA CAGGATGGGA TTGCGCTGGT AAAAACGCTG CAGTGGGTGT 4020 

ACCAGCAGCT TGa CGTGGGT GCAGACGTTG ATGAATGCGC TGTAGCGGAG TTTGTACGTG 4080 

CTGCCCGGGC GGTGTCTCCG TCTTTCATTG AAGAAAGCTT TCACACCATT GCAGGATACG 4140 

GGGCGAACGC AGCAATGGTG CATTACCGCC CCGTGCGTTT TTCAGCTTTA CACCCTGCTG 4200 

CGGGTCAAAc GGCAGCAcTg cTTCGCGCGC GTGGTTTTTT ATTATTGGAT TCTGGCGCGC 4260 

ATTATCGTGA GGGTACCACC GATGTGACGC GCACGCTGGC TCTCGGTCCT TTGACAGATG 4320 

TGCAGCGTGC AGACTACACG CTGGTATTGC AGGCGCACAG TGCgCTTGCC GTGCGcGCTT 4380 

TCCTGCAGGG AC CAGTGGGG CGGTGCTCGA CGGAATTGCC CGGGCTCCGC TGTGGGCACA 4440 

GGGACGAGAC TACCCACATG GGACGGGGCA TGGGGTGGGT TTTTGTCTTT CAGTGCATGA 4500 

GGGTCCCTAT AGTATTTCTC CGAGTGCTCC CGGGAGAGGA GGAACTGCAC GAGGCATTGG 4560 

GGCAGAGCAC ACGGGAGATC CTCCCTTTTT TTCTGAGGAG GCGGCGTGGC AGCTGCGCCC 4620 
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GGGTATGCTC CTTTCCAATG AGCCTGGGGT GTATGTGGCT GGCTCTCATG GCGTGCGCAT 
AGAAAATCTT ATGTGGGTGG TACAGGCGCA TGAGTCTGAC GCGCAgTGTG TGTGGAAGGA 
AGGAGGGGAG GGAAAGGAGG AGAACGCGGC GGCGCGTGAG TGTACGGGTG CAGATAGGAT 
GCAACCGTCA CGATGCCGAA GTTTCTATGG ATTTCAAACT GCAACGCTGT GTCCAATtGA 
CACGCGGCCG CTCGTGCGAG AACGATtGCA CGATGAAGAT ATTGCGTGGC TGAATGCCTA 
TCACTACGGG TGTATGTAAC CTCGCGCCGT TTTTAGAATc cGTACGCGCG CCTTTTTGCG 
CACGTGCTGT CGTGCGCTAT AGCGTTTTCC TGTTGATTGG TGTTATACGC AGATCTTGAT 
TTTTGATCAT AAAGGACATT AGCTTAGGCG GGTGCACTGC ATGTGGTGTC TGAAAAGATG 
TCCGCGGGCA AGAGGAGGGA ATTGTGATTA CGATTTTCGA AGCGCTTGAG CGTGTGCGCG 
TCATTCCGGT GGTGACGCTT GAGCGCGTGG AAGACGCAgT GCCGCTTGCA CGCGCCTTGA 
TAACAGGTGG TATCAGGTGT ATGGAAGTAA CATTTCGAAC GTTGGTTGCT GCGGAGGCGA 
TTGCGGCAAT CCGTCAGgAA TGTGCTGATG TGTTACTTGG TGCAGGAACC GTACTGACGG 
TAGAGCAGGC GCAgCAGGCG CAGGCAGCAG GTGCGCAgTT TGTGGTCAGT CCCGGTTTTA 
ATCCGCGGGT TGTTGCGCAC TGTTTGGGGC ATGGCGTTCC GATCATACCG GGGATAGCAT 
CTGCAACAGA AATTGAGCGT GCGCTTGAGT TTGGTATTTC gGTAGTAAAG TTTTTCCCCG 
CTGAGCTTTT GGGAGGTACG GCAATGATGA GTGCGCTCGC AgTCCCTACA CGGCGGTGCG 
TTTTGTGCCT ACGGGGGGAA TTCATCTTAA TAATCTTGCT GAGTATGTGG CGCATCCTCG 
GGTGCTCGCC TGTGGGGGCA GTTGGATGGT ACCGGCGCAG TCAATAGCGG CAGGAGATTT 
CTCGCAGgTT ACTGCACTTT CTCAGCAGAC GTTACAGATT GTCGGGGTAA TGTAGGGGGT 
GGTGGCTCAA ACGTTGTCTT TCTCAGAGAG GGCACGTATA CTGCCGGCGG CACTCAGGTG 
CGGGAGGTAA CTGGTGCGCA GAAAC TCG AG GACGATGGCA AAGATAGTGG CGcTGCTTGC 
GCTTATTCTT TTGCTCATAT TGGCAGGTTT TATTTGGTTT GATTATTTAG GAGTCCTCGA 
TGCAAAGCGG GCGATTTCTC CGTTGTACCG TCTTTTTGGA CGTTCGGTGC CGGAGGGAGT 
TGTG TCGACT GCTGATCCGG ATTTGGATGC GGATCGTTAT GCCAAGCGTC TTGAGGCGCT 
CGGGGAGCGT GCAGAGGAAT TAGATAAAAA GGACGCTGAG CTACAGGAAA AGGAAAAGGA 
TCACGAAAGG GTTTCTCAGG AGTTGGATGA GCGTCTGCGC GCGCTTGAGG ATAAGGAGAA 
ATCCTACAAC TTGCTTGTTG CGGAGACAAA CGAGCGTCGC GGGAATGTGC GTAAGATTGC 
AGAATACGTC AGTGGTATGC CTCCGGAGAG TGCGGTAAAG ATTCTGCTGA AAACTGATGA 
TCAAGATGTG ATTGAAGTGT TTCGTATGGT GGATGCGGCC GCTCGGCAAA GGGGTGTTAA 
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CTCTCTTGTG CCGTATTGGC TTTCTCTTAT GCCTCCTGAC AGGGCAGCTG AGATTCAGCG 
GAAGATGGCA AATAAACCTG CTGACTTTCC CTAGGCGGTT TATACCAACT TGATAAAGTG 
CTCTTATGTT TTTTTCTCTC CTTTCGCCGG CTGCTTTTCT GTTTATTTTT TTTATCCTGT 
ATTGGTACGT TTTCCGCACC GCGACGCAGC GGGTCGTTGT GCTGCTTGTA GCAAACTTTC 
TTGCTATTGC AGCTTTTGAT ATTCGCTTCT GCATTCCGTA TCTCGTTTTA AGTGCGTTAc 
CTACAGCTGT GGGTTGCTCA TACTCATGCA GAAAAGTTTC TTATGGAGAA AAGTCCTGCT 
CATTGCGGGT ACCTTGTTGC AGATACTTTT CTTTTGTCTT TTTAAACATT TCTCTGATAT 
GCTCTCGCTC GTGCGTGCAT TTGCTCCTGC ATATTTTGCG CAGCACACAT GGCACCAACA 
TGTAAAAGAC TGGAATATAT GGCACCCAGT GGGTATTTCG TACTGTACAT TCAAATGTAT 
GAGCTATGTG TTTGACGTGT ATCTGTGCAA GATACGCAGA AGAGAGCCgT TTGCACGTGT 
GCTTTTGTAT GTGTCTTTTT TTCCTCAAAT GATTTCAGGA CCTATTGCAA ACGCATCGCA 
TTTTTTTACA CGTCTGCCGC ACAATTTGCG CGCTGGTGAA AGCCCC TTAG ATCGTCCTAT 
CCACTTTGAT CGTGCGGTGG TATTACTGTA CACGGGGTTG GTCAAGAAAG TTATTTTTGC 
AGATTTTCTT TCTATACTTG TGACTGATAA AATTTTTACG CTTCCTTCCG CATACAGTAG 
CACCGAGTTG CTCTTTGGCC TCATCAGTTA CAGTGCGGTT TTATACTGCG ATTTTTCTGG 
GTACAGTGAC CTGGCAATTG CAGTTGGGTT GCTTTTTGGG TTTGAAACAC CGGCGAACTT 
CAAACGCCCT TACATATCTC AGTCAGTTAC TGAATTTTGG AGACGCTGGC ATATTTCCTT 
TTCTCAATGG TTGAAAGAGT ATTTGTATTT CTCACTTGGG GGTTCACGTT TTGGGATCAA 
AAGAACGGTG TGTGCACTTT TTTTTACCAT GCTGATCGCA GGTCTCTGGC ATGGCGTACG 
CTTGACGTTT CTGTTGTGGG GTATGGCGCA GGGTGTGGCT TTGGTAATTG AGCGGGTGTA 
TAGGGAAAAA AGACGGGTGA ACGGTGCGAA TGCCTTTGGA TCAAGTAGTG TGATGGGAAG 
ATGGAAAGCG CGTGCTATGC GGTGTATACG CGTCAGTGCA TTGTTTCTTT TTGTCaGTGT 
TGGATGGCTT ATTTTTCGCG CACCGTCTTT TGCAGAAGTG TGGCGGTACG TTACCTTGCT 
GTTCCGAGGA AGTTGGCATG GGCCATTCCA AGTTATCACG CCATTTACCG CGTTGCTCGC 
GCTGTGTGCA CTGTGTGTAC AACTCCCTTC AGATCGTACG CGTGCGCGCG CGTTTGCTTG 
CTACTGCGCA GTGCCCTTAC CCGTAAAGGC TTTGTGTGCG GCGTTCTTTT TCTTTGTACT 
GTCGGTTATG AC TC CATC AG GTATTGCGCC CTTTATTTAT TATAGTTTTT AGCGAAGGGG 
CTATGATGAC AACAGTGCGT GTTATATTGC AAAGGTGTGC ACGGGGAATA TGTAGTAATA 
AGGGTCGGTA CAGTGCGAAT CAGGTACTGC TTTTTTGCAT ACTTACGCTG AGCCTGTGGA 
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CGCCATTTCT CGGTCCGGCG TTGCGCCATC CTGCCGTGTA CATACGGCAC AAGAGCGTGA 8160 

AAAACATCTA CCTTGCATTC GTAGATCCGC TCATGCACAC TGCAGAACAG TGGGGAATAG 8220 

ATACGGTATT TCCTCTTTTG CGCGAGAGTT TTTTGCATGC GACAGGTTTG ATACAGCATC 8280 

CGGAGTGGGA AGATACGTTT TATCACTGCG AGCAACGCTC TTATGAGCCA GCTGCTGCCC 8340 

TGGCGGAGTC TGTGCCTTCT CCTGTCTTAC AGGAGGCCGT GGCCGTTTyC CCTCCGGGGG 8400 

TGGCGGTTAA CGATTCTGTT GCGGAAAGAA AAACAACTGC TCCCGCACGT GTCTTTTCGC 8460 

GTACAGCGCT TCGTATTTTG ATGTGTGGAG ATTCCCAAAT GCGTTACCTT ACCGGCGGTG 8520 

CGTTGCAGGT GCTAGGGACG TCTTCGCACG TGCAGATTCA AGAAGTGACG GTTAGTTCTT 8580 

CTGGTTTTGT GCGGACCGAT TATTACCACT GGCCACGAAA ATTTCTCGCG CTCCTGGATA 8640 

CGCACACCCA ACAAGAACCA TATGCAGCGG TAATTATGGC ATTTGGTATG AATGACTATC 8700 

AAAATTTTTA TGATGCGGAT GGCTCTTTGT GTGTGACGAA AACTGCACGC TGGGAACGCG 8760 

CGTATGAGCA AAAAATGCGC GCCTGTTTGA ATATTATTCT GCACACAGTA CCGAAGGTGT 8820 

ATCTGTTGGG TATGCCAGAG ACACGTAACA AACAGTTGAA TGAGAAGCTT GTGTACATCG 8880 

AGCACGTACA AAAGAAAGTA GTGGCGCAAT ACGATCCGcA GCGGGTGCGC TATTACTCCC 8940 

TCAAACCAAT TGTACCCGGT GTACACGGAA CATATGCAAG CGCGATAAGG GACACGCACG 9000 

GCCGTTGGGT ACACGTGATG CACAAAGACG GCATCCACTA CACCATAGAG GGTGGTGCGT 9060 

ACGTTATGGA AACTCTCTTA CCCCTTATTC TTGCAGATTT GGAACGGTCT CGTCACGGAT 9120 

ACATGCGTTC TTCTCTGGGG TCGCATGAAC TCCCTGCGAC GAAGGGGATG GAAAGAGCAC 9180 

GTCACGCGTC AACTCGAACA TAGGGATAAA CCGCACCGTT GTATCCTGCA TGACAGGGGG 9240 

TGCGTCCTGT CAGGGGGTTA TCGTACGTCC ACACCGTTGC CTGCACTTGC ATAGTGCTGT 9300 

TTTGGTCACC TGAGAATGTT ACCGTAAAGG GGAGTGGTGG GCGCGCCTGC GATATGAAnC 9360 

GTACC 9365 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5019 base pairs 

(B) TYPE: nueleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

AGGGTGTnCn CCGCACGGAA AATCTCnTCT ACAGCnTCCC GATTGGTGGG ATCCTGCTCA 60 
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ATGACGAGTA TATTGCTTTC GAGCTGCTCA ACCTGTTG t C CGCCTCGCTG AAAAAATCCT 120 

TCAGCAACTC TTCGTTGCCC GCGTCGAGGT AATCGCTCAT GGGCACAATC TTCCCGCATT 180 

CTCAGCCTTT GGTCAAGGCA ACGTCCAAAA GACCCCGCCA CCCGCTGCAT CCCTTGACAC 240 

TTCCATCAGC CGAGAATAGA CTCCGCCATG GCCCGCGCGC AGGTACGTTT GTCAGGTCGG 300 
TTTCTCGCCC TCGGTGGTTA CTCCTTGTAT GCCGCGGGAC TGGTCCTACT CCTCTACCGA 360 
GCGGTGGGGG TCTACGTTTG GATCCACTTT TATCAAAACC AGACCCTCTC GCAGCTTTCT 420 
ACCCACGCGC GGTCGTCTTG GGTATCGATT GGCGGACACG CaGCCTCTTC CTCGGCGCGG 480 
TGCTCTTGAC CGTTCGGGAC GTGCGCACCT TCAGAGACGA CAGCACGATG TGGATAGCCC 540 
CCGTCGTTCT CTGGTCCGCG GCGACCTACT CCAGCATGGG aGmCTGCGGC TTCTCTCTCA 600 
GGGCCGTAGC CAGCGGCGCA TGCGGCGTGC TTCTCTTCGC CGTTTGGTCG CTCGCGTcAG 660 
AGCTTGGGCG CCGCCGCCCT TAGCGGCGCG TTCGCCCGmG GCTGAGCCTT TTTCATAAGA 720 
GGGTTGcTGC GTAGGCAGCG CGCGCAAATC CGCATGTTCA GCGCAGAGCC ATCTAcAACT 780 
ACTcTCAtGC GAgCArGTTC GGTTTCCACA CCCGCTTGCC AGTGGTGCAT CGATTTACTC 840 
ACGGCGCACC CGCTGATCGT CCCCTTTCCA CACAGTCCAC ATCTTCTTGC caTCGkwmAT 900 
GTCTCCGCCG CCGATGATAA AGAAAAGGTC CGCCAGAGTA AAGGCGgCAC GCGCGCTTTA 960 

TGGCATAATG CTCCCCTGTG CGGTGGTCAA CCAACGCAGA AGGAGCGCCC TCATGTGCGG 1020 

AATTCTCTCC ATCGTCGTAA GTATCGCTGT TCTTGCACCG CTGGCAGTTG CAAACACCCT 1080 

cTGtACGCAC TGTACCGCCC GGGATGCAGA TGGGTTAATC ACGCGGTATC CACCTGGGGT 1140 

GCACGGACTA TCTTTGCCGT TCTACATACt ACGGAAGACT TAAGTTTTAC AGCGACTACA 1200 

CCCACGCTGA GCACCTGCCC ACGCAgTACC TTGTAGTCTC AAATCATCAA AGTGTGCTAG 12 60 

ACACCCCTGC ACTCATGCGT TACTTTGGcT AT AT t GATGC GCCGAGGTTG CGCTTCGTAG 1320 

C AAAGC GAGA GCTCGCGCGC TTCGTCCCCC TTGTGTCCAC CATGCTCAAA AGC GGTCGGC 1380 

ACTGCCTGGT AGACCGGCAT CAGTCAGGAG CACAGGCGAT GCAAGCGCTC GAAACGTTTA 1440 

CACACCACGT CATGCGAGAg CGATGGCTCC CTGTGATCTT CCCTGAAGGG ACACGATCAA 1500 

GAAACGGCGC CCTGCgCATC TTCCACGCCG CAGTTtCCGT GCGCTCACGG CACGGCTCCC 1560 

ACTCCCCGTA GTTGTATGCG CGTTGGACGG AGGGTGGAAC cTACGCCCGT TGCTGAAAAT 1620 

AGGACAGAAG CTTAAAGGAA GCGCTTACCG CGTCAAAGTG CTCGGCGTTT ACCCCGCACC 1680 

CCACAACAAA GACGAACAGC TCTCCCTCCT ACGCGAGGCA AAAGCGCTCA TTCAAGCGCA 1740 

gcTAGACGTC TGGCGCTCGG AGACCACCTG TCCTCAAGAC GAGTGCCTCA CGCGCGCCTC 1800 
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CAAAGACTCC TAGTTAAAGT CACGCGGCCG ACGCTCAACA CGCCTGCACT TAGGACACTC 
AGCGATGTTC TTCACCTTTC CGTTATCAAC AATAGACCTC AGCACCACCT CTCCCCCGCA 
CGGACAGTTA AAGCGCTGCG TCGCCGCCGC AGTACGCGAA TTTTTGCTGG CACCAGCCAT 
TTTCATACCT CCTAGTAGCT CGCTCGCCAG GCTGCATCAC ATTGTCCTTT TGGTCAAGCG 
TATGGCGAAT TCTAGGGGGA AGACACGACA CTCGCCCCGC CTACCGGCGg TACGTACTCT 
CTGCATCGCG ATATAGAACC GTGGAATGAG AAAAAATGGC CTCATATTGA CAGAGATTTT 
TCCTGTTCCC ATAATGGCAC CCTGGGTGGG GTAGGCGTGG CGCTCACTAA GTCGTTTTTG 
GAGTCAAGAT CAACCGGGGA GCTGTTTGCG CTTGCAGATG AGCTCGGTCT TTGCTTGCCT 
GAGGATCTTA ATCGAAGACT TGTCATTGGC GAGATCCTTG ATTGTTACCA CAGCGCTCTT 
GATTTGAACC CTCCGTGCGC TCCCCAGTCC CTTGAATCAA AGGGGACTTC GTGTGCCTAC 
AATACCACCG AAATCCATAT CCTTGCCCGT GACCCGCTTT GGTTCTTTGT CTTTTGGGAT 
ATCCACGAGC AACTCTTTTG CACACTCACC CAGAGTCCTC AATTTAGGTC GTTCTTTCTG 
CGCGTGCACT CCCTCGGTGG TCATGGCTGG CACACCTCGC TCGACCACTT CGATATTGAT 
GTACCCCTCA AAGACAGAAA GCGTTACGTG CACCTGTCCT TGGCCGACGA TGCTAATCGC 
ATAGATCTTT GCTGCAAAAT GCTCCAACGC GAACGCATCC TTGCTCAATC CAGAgTTGTC 
ACGCTCCAGC GCAgCkTCAT AGAACGGAGT CTTwACCCCg AGGATCCAAC CGGCGCAGAA 
GTTCTCAGCC TCTGTGGGCT TCCCCTGCTT GAGGAAACCT ACCCAAGCAC GTCTCTTCCT 
GTGTGCTC AT AAGTGGGACC TTCATGAAGA AAATGCCaGT GCACTCTCTT GCATTTGTTC 
TCGATTGTAA TCTTCCcTTC GTCCGAGGGG CCGGCGCATC TTCTCTCCTC GCTGAATCCC 
GTTTTTTTCT CGAGATTTCC TATACCTACC TCCCCCTACT CCGCTTATGC GAAACACTCG 
AACGTGAGCG TGTTCCTTTT AACATCTCCC TCGCTATCGG GCCCGTTCTG TGCGAAATGC 
TCGCTAACCG CGTGCTTATG GACCGATACC GGCGTGCACT CGACGCACTC ATCGAATTCG 
GAGAACGGGA AGCCATTCGC CTGAGGAACA GTCTCCAAGA GCGCGTGCAA GCTGAAGCAG 
TGCTTCGGTC TCTTCGCTCT CACCGGGATT ATTTTGa TC A CTGmGATGGG GCACTCCTTG 
AACGcATCAA TCACTTTTTC CGcACAGGTT CCATTGAATT ACTCGcAACA ACGGCAGTTA 
ATTGTTTCTT ACCCTTCTAC CAAGaCATGc CCGAATCTAT ATCCGcCCAA ATCGAAATGG 
GGCTTATTAA TTACCGcAAA CATTTTTCCT CAATTCCCCG CGGTTTTTAT TTACCTGAAC 
TTGGCTATGC ACCAGCGCTT GAGCGCACTA TAAAATGATA CGGATTCTCG TACACCATAT 
TGGAAACACA TAGTTTCCgT TTGGcACTCG CGTACCCCGA CGTGGcATCT TgAGCCkGCA 
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CAGACGTCCA AgGCTTGTGG TGCTTAGGAA AGGAsCGTGT CGCCACkGCA G AAGTGC A t G 
GCGCCACgCA TTCCTTCTGT ACACAGGCGG TGTACGGAAA TACAGAACAA GATGCTGGGT 
TTATACTGCC TGAGGAGGCT CTGTACCCTT TATTTGAACC ACACAAGGAA CGCATGGCTA 
CTGGGTATCT GTACCAGGCC CGTTCAGGCA CGCCGTATGA GCAAGAAAAA GCGCAACGCA 
CGTGTGTCGC TGATGCGCGG GCGTTCGTGC GTAATCGGAC AGAAATATTT GAAAAAGTAG 
TCCACGCAAC CGCTCCCTTC GAGGCTATGT CGGTGTGCGT ATTCCCTGCA TCACTATTTG 
GAG TTGCATG GGCAGAGGGG ATGGATTGGC TTGAAGCCGT ATTTCGCACC GTTGCAGAAA 
GCGCGCAAAT GCGnGTgTCC TGTACGGCGG CGCTCACCTG CCCCGCAgTG GGGGTGTCAA 
TCATTGAACC CTTTTTTGGA TCTTCACTGG GGGGAGGTTA TGCGGATGAG CTTATTAATA 
GCGCAAACGA CTGGATGCAT CCTGCAATAC AAAAAACCAC AGAACGCATG ATCGACCTCA 
CAGAGCGCTT TGCACACGAC ACCGGCTTTC GCGAACGCCT ACTGAACATG GCAGCACGTG 
ATGTGCTTTT GTGTCAGTCG CTGTTCTGGC CCCTTTTAGG GAACCATTGT CGCTACCCCG 
AGTACGCCGC TAGCGAGTGC GCCGACCACC TCAAGGCCTT TACGAGGGTA TACGAGGCGC 
TCGGCTCCGG AGAGGTAAGT GCACAGTGGC TAATGCGGCG CGAACGCGAG CTACCACTGT 
TCTCTGAAAT TAACTTTCGC TTTTTCAGTA AAAAGAAATG ACACCTCACC AAAAGGCGAT 
CTCCCCAGAG TAATACGATC GCATCGACCC AGCGCTATCT ATGCACAGCG CGCCGTCCGC 
GGCCAATCCG ACAACACGAC CCAGAATCGG AGGACGTGTT CCCGCACATT CGCGGAAACA 
GACGTACTCA CCCTGCTTCC ACAGACACGA CTCAAGCACG CCGATACTCG GcGGCGCCAT 
GaCACAGGCA TACAGCCGAT CCAACAGCAC AGGAAGGAAA GCAAACGGAT CAGGGCACCG 
CTCGCCTCCG ACAATCTGCG CGAGAGAGCA GGCATGCGAC AACTCGGGCG GAAACTTCAC 
CTGcAGGAGG TTACACCCAA TACCCACGAG AArCGCTCCC GCGCGcAcCT GGCAGAGCAC 
CCCGGnGATC TTACGATCGC ACACCAGGAC GTCGTTGGGC CACTTAaTGc GCGGTGCGCA 
CACACCTCCG AGGAAGGCCA TGTATGCAAG AGCGACCGCA TATCCAACAC AGAGCGAAAA 
CGCAGGGAAA GCAACACGCC GCAAAACGAC AGTACACAAA AGATTCTTTC CCGGCTCCGA 
TTGCCACTTT CTCTGGTCAC CACGAnCACG TCCAGCAAA 
(2) INFORMATION FOR SEQ ID NO: 85: 
(i) SEQUENCE CHARACTERISTICS: 



3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5019 



(A) LENGTH: 3843 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



GTGATCACTC 




TTAGATCCTA 


AGAACGTGGA 


AATTGACGAG 


60 


CACAGCTATG 




AL> LXj 1 


TCTGGGCAGA 


GCGGCAAATT 


120 






Cj l\jt-Av»L. 


AGGATGCGCG 


TGTTACGGTC 


180 


GACGGGGTCA 




gataacgtgt 


ACGCTATT AT 


TTCTAAAATT 


240 


/Itt 1 lv.nv3 


GCGCGGCAGT CCGTGCGTAC 


CTTGACCCGG 


TTACCAACGG 


TTTGAATATG 


300 


uAvxAl. A it Ivj 


ACGCCAGGCA GTTGTGGTTG 


CAAGATGAGG 


ACGAGGCAAA 


CGTGTTTGCC 


360 


TCGTTGGGTT 


TGATTACCGA AGGcAGCGTC 


CGCCGTACAA 


TGTGTCAGCA 


GGTGCACGGG 


420 


TTTCAGGCGG 


GTCCATTTTT GACATGGCCA 


TTGCGGTGCG 


CAATGCCCTG 


CTTGCAGGTG 


480 


ATCAAGAGTC 


GCTCGGGGGT AAGATTTTAG 


GAAGTGCAGA 


TGGTGCAGTT 


GACAACCTTT 


540 


CAcTGCGTCT 


TGCCGAGACA GGAGCGCGGT ACgcACGTGC ACAGGCAACA CTTGCACGTT 


600 


TTAATAGCCA 


CATTCCGAAT GTGGTTGCAG 


CGGAGTCTCG 


TGAGTCTGAT 




660 


CGCAGGCGAT 


AAuijLxAl iTG AAAATGTTTG 


AGTATACGCA 


CCAAGCAACG 


CTGAGTACGG 


720 


TGGGCTCTTT 


GTATAAGCAC ACGCTCTTGG 


ATTATTTGCG 


GTAGGGGAAG 


GAGCCGCACG 


780 


CTATGGAGAT 


TCAGACGAAG ACGCTCGGTA 


CACAAACGGT 


TGAGGCACAC 


CAGATTATTA 


840 


pprnmpp ^/'W^ 
t-t? J. 1\jCjAVjCO 


TGGTCTCTAT GGTTTTGAGA 


AATATCATCG 


CTTTGCGTTA 


TTTGATGCAG 


900 


J. vjCAvjVj 1 1\.L 


GTTTATTCAT ATGCAGTCCT 


TAGACGATCC 


GGCGTTG TCC 


TTCATTGCTA 


960 


mfTv-i 7ATV"*/T" ,r P r P 


TTTGTTTCGT CCGGACTACG 


AATTGGACAT 


TGACGATGTA 


TTGTTGCAGC 


1020 




TTCTTCTCCT ACCGACGTCT 


TGGTGTTCGC 


GTTGGTGACC 


ATTCCTCCCG 


1080 


a Xuun i. \— ± V3v» 


GGTGACTGCA AATTTGCAAG 


GTCCCTTGAT 


TGTGAACAAG 


AAAAACCGCA 


1140 


AGGCGATGCA 


AGTGGCGATG GGTGGCGATC 


GGTGGAGAAC 


GAAGCACGAT 


ATCGTCGCCG 


1200 


AAATGGCAGA 


AAGAAGGGCG CAGGAACAAT 


GTTGATCCTT 


TCGCGCAAGA 


CAAATCAGAA 


1260 


AATCTTTATT 


GGGGACTCGA TTGAACTGAC 


TATTATTGAG 


ATTCGCGGCG 


ATCAGGTAAA 


1320 


AGTCGGTGTG 


GAAGCGCCGC GTTCGGTGAA 


AATATTCCGA 


CAAGAGGTAT 


ACGAAGAGAT 


1380 


CCAGAGAGAG 


AACCGCGCTG CGTCCGACTC 


CCCCTGGTCT 


CCTAACTCAT 


TGCCTCAGTT 


1440 


GCCTGTGTAG 


TTGCAGAGGA TACCCATCCC 


TCGGGGTGGG 


AGTGTTTTGC 


GCGGATGACT 


1500 


GAGTTCACCT 


AAATCGCACC GCGTTCCAGT 


GCGCATTGTA 


GTGTTCGATT 


GCTGCCCCCA 


1560 


CATCTGTTTT 


TAGCGTTCCT CGCTCGAGGT 


CATGCTCGCT 


GTAGTATGGA 


GTCTTCTTCT 


1620 
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GATATGCTGC CGCTTCCCGG TGGATAGTTG AAGGGAATCC AAAGGTGTCC AAAAACTCCG 
CGTAGTGTGC AGGCTCTAGG AAGAAGTTGA TAAACGCATG GGCAAGGTCG CGATTGCGTG 
CCCCCTTGGG AATGCAAAAG CTGTCTACGT ACACTGGGCT GGCGAGATCT TGTGGTATAA 
AGAAGTCTAT ATGTTCGTGC ATTGCCTCAG GAGTCTCTGC AAAGAAGGCC TCTGCAAAGC 
CATGAGCTAC AACAAAGTCT CCCGATGCAA ATGACTTTGC GTATCCGTCC GAATCAAACT 
TTACCAAGTT TGGTTTCCAG TGGTCGGTGA CAAGTATTGC TGCCTGCGCA AgCTCTTGTT 
CGTTTTTTGT GTTTACGTTG TAGCCAAGTG AAGCAAGTGC AGCACCCATT Ac TTCGCGC A 
TATCGTCCAT CATGCTCATA CGATACGCCA GGTCTTTGCG TGAGAAGATA GACCACGTGC 
GCGCGTATGA CGGAACTGCT TTTTTGTTTA CCGCAATGCC TGCCGCTCCA AGATAATACG 
GCACCGAATA TTCCATTTTT GGATCGTAGG CTATGCGAGC ACGGACACTC TCTTTGATAA 
ACTGTACGTT GGGAATCTTG GATAGGTCAA TTTTTTCCAA CAGATGCTTG CGTTTCATGA 
TGCTGACAAA GTCACCCGAA GnGCCACTAA ATCAT AAC CA CTTGCACCAA TGCTCAGTTT 
TGCAAACATA TCTTCATTTG AAGCGTAATC ATCATAGACT ACCTGCACGT TATACTGTTG 
TTCAAACTTT TTAATGAGGG ACGTCGGGGT GTAGTACGTC CAGTTATACA GGTACAGGAC 
ATCCTGTCGT GTCTGCAGGC ATGATCCCAT CCAGAGGGAG AGAAAAAGGA GAGAArGAAT 
GCgCGAACTG CTCACACAAA AACGTTTCAT GCTTTGCTCC TTACGAGATT GTTTGCAGAA 
AAGTAGTTAC TTnGAATGGA. CTATTGTTTT TAnAGAATTG CGCAGGAGGT AGGCGACCCC 
TACAATCCCT GCCATCATGA TCAGGGAAAG GGCATTGATG ATaGGAGAGA CCCCATAGCG 
GATCATTGAA AAC A C AT ACA GGGGGAGTGT GGTGGAGCCC GGTCCTGCAA CGAAAAAGGT 
GATGACAAAA TCTTCCAGAG AAAGGGTTAC TGAAAGTAAA AAGCCAGACA GTATGCCTGG 
CATGATGGCA GGGATCACGA TTTTTCCTAG CGCTTGCCAC TCGTTTGCAC CTAAGTCTTG 
CGCCGCCTCT ATGAGAGAGA GGTCAAAGGT GTCGATGCGA GTAAGGATGA GCAGGAGCAC 
GAAGGGCAGA CAAAAGGTGA TATGAGCGGT GATGnATGTT GCGCGCCCCA GCGGCAGGCG 
TACTAGGGAG AAAAAAACGA GCATTGCCAT ACCTGTGATA AC CTCAGGG A GCAGCATGGG 
CAGAAGGCTC ATTACCTGCG CATATAGCCG GCCTGAGAAA CGATACCAAC GAATTGCGAT 
GGCAGCGGCA GTCCCCACAA TTGTTGCTAC AAGTGCAGAA ACAGATGCTA TAAGCACGCT 
ATTAAGAAAG GAAGACCACA GTTTTTCTGA ATAAAAGAAT AGCTCTGTGT ACCAGCGCAG 
CGAGAAACCG GTCCAGATAA GGGATTTATC CTTGTTGAAG GAAAAAAGCG CAATAACTGC 
AAGCGGCAGA AATAGGAACG AGACAACTGC CGCCAGCAGC ACAGCAGAGA AGGAACACCG 



1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
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TAGGGGAgTG CGTGCGCGCG TGTGGATCAG ACGGCAATTA GGCATATGAG CGGCGTCCTC 
CGCAAGGGGG AGGTGTTGTA TTGCAGGCAC CGCTTTCTGC AAGTATCCAG AGTACTCCTA 
CCCCCCCTGC GAGGGTAACG AGCATCGCGA AGGCCGAAGC GAGTGGCCAA TTTCCCACGA 
TACGTACCTG GTCCACAATT GCGTTTCCGA TAAGGTAGGA ATCCTTTCCT CCCACCAGGA 
GGGGGACGGT GTAGGAACCG AAGACAGGAA TGAAGGTAAA GAACACGGCG GTGGCAATGC 
CGGTTTTGAT GTTGGGAAGC AACACGCGGA TAATGGCACC CGTGGGGGTA GAGCCTAGAT 
CGCATGCAGA TTCAAGGAGG GAGAAATCGA AGCGATCGAT AGCGGCGAAA ATAGGAAAGA 
TAGCGTAGGG CAGGAACATA TAGGTGAGCA CCACAATGAC TGCCCCGTTA TGGTACAGGA 
GCG 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
CnTCATCCGG CCCCAGAGGG AAAAAACGCG CGCCGCGTTT CCCGTTGTTT CAGGCCCACC 
AGCnCGGTGC TCCACGTTCC AGTGTACCCG GTAGTGCGCA CGTACTATGC GCGCATGAGA 
GATTGGGTGA TCCGTGCTGT GCGCGCTCAT CTTTTTTGCC AGAAGGAGTG CGCATGGCCG 
TGCACACCCG CCGGCACAGC ACCACGCAGG CGGAAAGCTC ACTGCAGCAG ATAAgGGGGG 
AGCATGTTTG AGCAACTGAG CGCGAgcTTT AAACGTATCG TAGGAGCGCT GGGAGGACGC 
GCAACTATCA GCGAACGGAA TATTCAAGAA GCAGTAGAAT CAATTAAGCG CGCACTGTTA 
GACGCCGACG TGCACGTTCG CGTAGTGCGT CGCTTTGTGA ATCAGACCAT ACAACACGCG 
CAGGaCAGAC GGTGCTTGCG TCGGTTAGCC CTGCGCAgcA ATTCATAAAA ATTGTACACG 
AAAGACTAAC TGCCTTCCTC GGTGAACATA CGCGGTCGCT GCATCTTAAG GGGCCCGATA 
CGCAnTCGAT TATTCTCTTG CTTGGGCTCC AAGGATCGGG GAAAACTACC AGCGCAgCAA 
AGCTTGCTGC GTACCTGAAG GATGCAGGTC GCTCCCCTCT CCTGGCCGCT TGCGATCACG 
TTCGTGCCGC AGCGAGTGCT CAGCTGGCCG TTCTCGGCAC GCACATTGGC GTTCCCGTGT 
ACCAGCATGC GcTGCCGCAC GAACAGCAGC CGTGTGCTCT TGATACgGcG CGCGGTGCGc 
yTTCAGTACG CGCGCTCACA CGGCAATGAC GTACTTATCA TTGACACTGC TGGCCGTCTC 



3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3843 
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180 
240 
300 
360 
420 
480 
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600 
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720 
780 
840 
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CACGTGGATG CCGCGCTCAT GCAGGAGTTA ATCCTTCTCA AAGAAACACT GGTTCCTGTG 900 



GAAACACTCC TTGTTGCAGA CGCTCTAACC GGTCAGACTG TGGTGCGCAT TGCAGAAGAG 960 

TTCCATGCCG CGGTGGGTAT TTCAGGCGTT GTGCTCAGTA AGTTTGATTC AGACACCCGC 1020 

GGCGGAGCTG CACTGTCTTT GAAAAGTATT ACCGGTCAGC CACTGCTGTT TGTTGGAACC 1080 

GGTGAACGAC CGCAGnACTT TGAACCGTTC CATCCCGAGC GAGCCGCCGG AAGAATTCTG 1140 

GGTATGGGGG ACATCGTTTC TCTCGTGGAA AAGGCGCAAA AAGCCTTTGA TGCACAGGAA 1200 

CATGCGCGTG CGCAGAAGAA AACGCAATCG CACGAGCGTT TCACGCTCAG CGATATGCTC 1260 

GACCACCTGC AAACTATAGA AAAAATGGGA CCGCTGCACT CGTTGGTGGA GATGATTCCC 1320 

GGTTTAGCGG TAGCCGTTTC TGCCGATGCT CTTGACGCGC GCGCGTTCAA GCGTCAAAAG 1380 

GCGATTATTC AATCGATGAC CGTGCAAGAG CGTGACAATT TTCTCATTAT CGGCCCCTCA 1440 

AGGAGGCGGC GCATCGCGGC AGGGTCAGGC ACTTCGGTGG CTGATGTTAA CCGTTTAATT 1500 

AAGAATTTCG AGAGGATGCG CACGCTCATG CGCAAGACTG CATGGCAGTC ACGCCGGGCA 1560 

CACCCTAAAG GAGATACACC CTATGGATGG CCACATCGCT AGATACGCCG CCTGCGCCCT 1620 

TTTCGCACTC AGTGCGCTGT GCsCCCCgcT TCGCGCGCAG AAGACGCGGA CCACTCCTGG 1680 

TCCCCCGTAC ATACCCCACC CTACGATCCT ACACTTTTCC AGTCGGACGC GCAGcGTGCC 1740 

GCCTTCCACA CCTTGGCGGC GGAACACCTT TCCTTCCTTA CCGGTCACAT GTGCTTCTTC 1800 

CGTCCTATCC CTACCCGCGA TCCTTTCCTC ACCCGTGCCT ACGAAATCTC CCCACATCCC 1860 

CGCACACAGA AACCCACCGT GCTGCTCGCC TTTGACTCGG ATATCATCTA CCTTCTTTTC 1920 

TACGATCACC GACCAACAGA TTTCCCCGCC CTCCGCTTCT TTCAAAACGC ACCTACTTTC 1980 

CAAGAACTTC CGAGCACCTT CTACCCCTAC ATTGCCATGC ACAGCGACGC CGTTCTCGTG 2040 

CGACATCCAA CGCCGCGCCC CCCTACCCTT CC 2072 
(2) INFORMATION FOR SEQ ID NO: 87: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

ATGnTATGAT CACATTCATC ACATCTCCTA CATTTTCTTG CGCACACGAA AGCTCCACGC 60 

TCATGATAGG CTCAAGCACG TATGGCTCAG CGGCTACACA CGCCTCACCA AATGCTTGCA 120 
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CCGCTGCAGC TTCAAATATA AAAGGCGAAG ACGTCAGTTC CTGATACTCC ACCGACAGGA 
GATGCACCCC CACGTCTACA CACGG ATAC C CACTCTGAAT ACCTCCATCC CACGCGCCGc 
GGATAGCATG CTCTACCGCT CCGATAATTT CTGCCGGAGC AGTATGCGCC GTGCACACTG 
TGCCCCGCAA CTCTTTCACC TGACAAAAAA ACTCATTTCC CGCACCAcgC TTCCGCGCCT 
CCACACGCAA GgTTAACCCT GC CAT AT ATT CTTTGCCACC GATTACTCGC TGTACTCGCA 
GCGTACGCTC AACTGTTTTA CGAATAGATT CACGGTACGT CACGTGCGGC TTACCAACAC 
GTACCTGCAC ATTAAAATCC TCGCGCATCC GGGTGGTGAG CACATCAAGA TGCAACTCCC 
CCATACCGGA AATTAAAAGC TGCCCTGTCT CTGCATCTTC CCGAACCGAG AAAGTTGGAT 
CCTCTCGCGA AAGAATACCA AGTGTCTCCT GTAACTTATC GCGCGATGAT GCGTCCATTG 
GCTCAAGCGA CACAGAAATA ACCGGTTCAG GAAAATGCAT AGACTCCAGC ACTACCGGAC 
ACGAACCATC CCCCACGCTG TCCCCTGTTT GTGCAGATTT TAGTCCCACA ATTACCGCAA 
TATCCCCCGC CTGTATGCAT TCAACGGTTT CAGACTTATT CGAATGCATA CGCAAAATGC 
GATACACCCG TTCACGTTTC TTTTTCCCAA TGTTAACGAT ACTGTCCCCC GTACGCAGTT 
TTCCCGAATA CATGCGCACG TAGCAGAGTA AACCcGCTTC ACGTTCGTAC TGAATTTTAA 
ATACAAGTGC CAGCAACGGT CCTTCAGCAG TAGGAGCGAT AAAAACAGGC TCCTTTTTCT 
GTACGTGAAA ACCTTCTACT GCTTTacGCT cgCGGCGCAG GCAAtACTCT AmCACTGCAT 
CGAGC aGTGG TtGCACACCC AAGTTATGAC GAGAAGAACC GCACAAAAAA GGAACATATC 
GwCCGtCGCG CACAGCCTTT CTAATTTCTG msTGCAgTAA CTGCACTGGA ACGTGCTCCC 
CTGCAAGCAC ACACTCGGTT ACCTCATCCG AATATATGGA AATGACATCA AGCATTTTTT 
CTcGCGCttC gGcCTGGGCA ATACGTGCGC TTTGAATAGG CCGGTACTCC ATCTGTTCGC 
CACTACTTGC CGCATCCCAG AAAATCTCTT TCATGGTGAT CAAATCAATT ACCCCCTCAA 
AAGAGGTACC AGAACCAATG GGTATCTGCA ACGCGACTGC ATCTAT AC CG AATTTATTGT 
GGACTTGGTC CAATACTGAG AAAAAGTCAG CACCGATCCG ATCCATTTTG TTAACAAAAC 
AAACACGCGG GATATCATAA CGATCTGCCT GGATACCATA CGGTTTCTGT CTGTGGGCTG 
TACTCTTCCT ACCGGCACAC AATACCACCA CTACCCCATC TAACACGGGG CAACGCAnTT 
CGACTTCTGC AGTAAAATCT ACATGtCCCG GcGTAcAATA AtGGTAATGT CTACTTCACG 
CCACCGCACC GTCGTTGCAG CACTCTGAAT GGTGATACCG CGTTCCTGTT CCTGTACCAT 
CCAATCCATC GTTGTCGCAC CATCATCAAT TTCCCCCATG CGGTGGATCT TGCCCGTGTA 
AAAAAGCATA CGTTCAGTGG TAGTAGTCTT ACCAGCGTCA ACATGTGCCA TAATGCCAAT 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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1680 
1740 
1800 
1860 
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ATTCCTCATC TGCTGTTGTC TCATATCTTC CTCTTTATCA CGTACCTGTA CGGATAAAAA 
ACAGTGCACA GGACTACTCC CCACCATCAT AGCAGCGGGA AAGAGAATAC CTAAAACGTA 
AAACAGTGCA CGCTACACCG CACGCGCATT CAGTGACAAA TCACAAATCG TCTGTGGAAA 
CAAAATCTGc AATTAAATGG GGTGGTACCG AATCTTTCAG TTTCCTTACA ATAGACTGCA 
GCACACGCAC ATCTTCGGCT TCACTTAACA GAGACTTTTT TTGACGCAAA AATTCAACCG 
CCTGAATATA TTTTTTCGCG TGCACCAATA CGTTTGCATA CTCAAGAATA CGTTGAGTAT 
TTTTCTGATC ACGCTCATAC AACTGTTGAT AAAGAAGTAG CGCGCGAGCT GTTTCACCAA 
AAGAAAGCaA cCATAnCGTA CGCGGCAGCA ATGGTTGCAT TGGCAGAATC CGCATCGTAT 
AACGGTTGCA GTGCATGAAC AGCaGTTTTC CAATCATTCT GCAATCCACA CACACGCGCA 
AAATTATACT GCGC TGCATG ATAGTACGCA GGGTCACGCG CTGCACGCAT ATAATAATCA 
CGCGCCTCGT CATAACGATG CAGCTGCACA TACGCATGGG CAAGATCAAA ATACTCTGGC 
GCAAGAGACG ACTCAGGCTT TCCGCATGCA AATAGTCCTG TATAAAACAG GTGAAACAAA 
AAAAAGCAAA AACACTGCTT ACCGTTCATC AGTCAGCGAG AACCTAATAC CGTTTCCTCT 
ATCTTGCGGA GCTGGGACCG GTACAGGCCA GAAATATTTG AACCATAGTC TGCCAAAAGC 
TGGATAATAT TCCGCGGATA GTCTTCAACA TCAACCCCGT TAACTCGATC ACCAGCATCG 
CCAAGACCAG GCATAATATA CGCACGCGCG TTGAGTACGG GATCCATCCA CAGCGTATAC 
ACCGTGCAAT TCTCTAGGGA ACGCACTACA CGAATCGCAC CTTTCAGTGC AGAAATCATG 
TGAAAACAGC TGATAGATTT TGGTTTCACA CCGAGATCTT GCAAATAACG CACTATGGTA 
ACCAAACTAC CACCGGTGGC GTTCATGGGA TCGGCGAAAA CCAGATCCTT ACCATCCAAC 
TCTCGTGCAG AAAAGTATGA TTTATCCAGA TCAAACACAT ACTG cATATC GCGCTCATCA 
CGGAGaTCAT CTCGCTTGaT TTTAAAAAGC GCAAACGGcG TTACGTACCC ATGc GAAGAA 
TATTCTTCTA TCTCCTTAGA AACAATCATC GAaGG TAAC A GCGCTCCTCG TAACATGACA 
CACATCACCG TGTTCTCAAT TTTATAATCC ACGTTTGCAA TTTTATGTAC TGCAT AG TTT 
TGTACAGGAA AAGCAACCGG TGTTTTTGTA ATAAGATATG TTTTATGC 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
AGCCGTTGCG TCCGnTGGGA GTGGATTCAG ACTGTGGCGn CAGTGGAGGT GGTGGAGCAC 
GGGGAGCATC CGTGGTTTTT CGGCGTGCCA GTTCCACCCG GAATTTTGTT CTCGACCGAA 
CCGGGCTCAC CCTTTGTTCC GGGCGCTGGT GGCGGCGGGC TTGGAGCGAA AAGACAGTCG 
TTCTTGACGC GCGCAGGTAT TTTTTCTAGA CTCCTCTGTC CTTCTCAGCC GGTTGAAAGA 
GCTTTGTGTG GGCGTGCGGG TTTTGTTTGC GCGCGCTCCG ATGCGGTTGC AGGTTTGAGG 
GAGTGGATAT CCGTTTTGTG GGGAGGGATG CGGCATGCAG GGAGTTATGC GATGGCAGTG 
AAGCGGGCAC GGGGAAAGAT TGTACGTCGG CTGGGGATTA ACATTTTTGG GAATCCAAAG 
TACACGCGGC TTTTGGGCAA GAAGCCCGCG CCTCCAGGCA AGGAGCATGG GGTGAAGCAG 
CGGGCAAAGG TGTC TGTGT A CGGGGAGCAG CTAAAGGAAA AGCAGAAGTT TCGCTTTGcg 
TATGGAATGT CCGAGCGACA ATTCCGTAAT CTTTTTGCTC AGGCACATCG GATGAAGGGC 
GTGACGGGTA ACAATATGcT GTCGCTAATG GAGCGGCGGC TGGATAACAC GGTGTTCAGG 
ATGGGCTTTG CGATCAGTCG TGTGCAGGCG CGGCAGATGG TGTCACATCG TTACTTCCTT 
ATCAATGGGA AGACGGCCAA TATCCCTTcC ATGCGCATTA GCGCGCATGA TGTCATCACT 
ACTAAGAACC GGAAAGGTAT TCATAGCATC ATTCGTCACA ACCTGACCCT TTCTCAGGGG 
CAGCGCGGTT CC TGGCT AAA CGTGGATGAG GAGCAGCTTT CGGCAACTGT CTCTGAGCTG 
CCGCGTGCGC AGGATATCCA TCCGGTGGGG AATATCCAAC AT ATCG TGG A GTACTACTCG 
CGGTAGGATC CTTTGCCACT TTAGCTGGCG TTGCTCAATT ATCTC C CAAG TCTTACCAGG 
GACTTTGGGG CGTGGAAGGA TGGCGGCGGT GTATGGATGA TGGTCCCTTG AGGGTTGTTG 
TGCTTACCTC ATTTGTCATA CTCGTAGTAG TCTGTGCCGT TGCGCTGTGT ACTTTTTTTG 
TGTTCCTCAA AAGCCCTGAT CAGGTGATGA CTCCCCATAT CGTGGGCAAG GACTTTGTGT 
CTGCTGCTAT AGAGATGCAG GCAAAGGAGC TGTATCCCCG CGTTCAGTTG CGGTTTTCTA 
CCCGTGAGAA GCCTGGTGTT GTTCTTGAAC AGAACCCACC TGCGGGGGCC ATCGTCAAGG 
CTGGGCGCTA CGTGGACCTC GTAGTGAGCC AACAAGCAGT GACTACGCAC GTTGAGGACT 
ATCGGGGATT GCAGGTTGAA GAAGCGGTGG CGCGCATCGC TGCTGCTGAA GTTGAGCGCC 
GCATCTCAGT GAAAACACCC CACTTATATC GGTTCAGCAC TGGCGCAgCT GGcACCATTT 
TGGAGCAGGA CCCTGCTCCT GGCGCGGTTC TGTCTGCGGA TGTAGAGTTG CGTTTTGTCG 
TCAGTAAGGG GTCTGAGCGC GAGCAGACTA CAGTCCCCCT ATTGGTAGGA TATAGTTTGC 
CTGAGCTGTA CCGTGTTATG GCGCAGACGG CGCTCACCTT GCAGTTTACC GTATCTCCCC 
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CGTCTCCTTC TGGGGAGAGA AAAGACGGAG AAGCACGTGG AAGAACGCGT GCCAATGCGC 1740 

AGGACTACGC GCGGGTTTCA GCACAGGATC ATGACCCTGG TTCGCGCGTT GAAGCCTTTC 1800 

GCGCCATGCA GGTGCAGGTG CTCTTTCCAG AGCGTGGAGA GGCTCACGAA ATATACGGTA 1860 

TCTTAGCTCT CGATCTGCCG CGTTATCCGT ATCCTATGTC CTGTGTGTTG GATGTACAGT 192 0 

ATCCAGGGGG GGTGCGTACC GCGCTTGCAA TGTTTCAGCA TCCGGGGGGA CGTTTCACCA 1980 

TCCCCTATGG ATTGCCTGCA GGGGCGACGC TCTTCCTAAC GGTGGGGGGG AAGGAATTGT 2040 

TTTCTGGAGA GGTGGGTGCA TTGCCTCATG CAGGTTCCTA GCAGACGTGA TGGAGCACTG 2100 

CGGGTGCAAA GGTGCGATGG CTGCGTGGTC TGGCGCAgcG TGTGTTGTGC ACTGC TGGTG 2160 

GCGCTTTTGT GTCTTGCCGT CGGCTGCGAT TCCCCTGATT TGCTCGTAGA TAGCGATCTG 2220 

TCTCTTTCGC GCGTGCGCgT GGCAAAAACG CTGGTTATGG GAGTGAGCGA TCGTACGCCG 2280 

CCGATGTGTT TTCGCTATCC GAATGGGGAG AtTGTTGGTT TTGATGTTGA TCTTGCGCGC 2340 

GCGGTCTGTC GTGTATTGGG GGTACGCCTT ATCATTCGTC CCATAAAGTG GACGCTGAAA 2400 

AGGAATg cGC TGCGCTGTGG TCTTGTCGAT TGCGTTTGGa CGGCGTTTGC CGTAaCGcTC 2460 

GGCGCCGCAC TGAGTTTTTA CTTTCCGAGC CATATCTGCG TACTGCGCAG GTACTCCTTG 2520 

TGCGTGAgGG CAGGTTGCAT CCGGATTTGG CACACGTGGA ACGGGAATTG GGGCAGCGTA 2580 

TGACAGGTGT TTCGGCTGTG CATACGCGCG GTGATATTTT GCCTATGCGC TCGTCGCATA 2640 

GACAGGCTCG CATCGCCGTG TTGCGCGGTG GTcCGGTACC GGGAAATGAG AAGTGGCAGT 2700 

TTGGATTTGA ACCACACGGG AAGGTTGTGT GGTACCGACA CCGGAGTGCC ATGCTTGrAG 27 60 

GCnTGCGCAC CCGGGCGGTG GACGCGGCAC TTGTGGATCT GGTTGAGGCT CATGACGCAG 2 820 

TGCATCGTCA GGGTGCGCCT CTGAGGGTGA TGCGGGTACC GCTTGGGTTG AGCCAGTATG 2 880 

CGGTTGCATT TCGGCGTGAG GATCGTGCGT TGCGTGACGA AATTCAGCGA ATCTTGTATC 2940 

GTATTGCTGC CTCCGGTGAG GCATACCGTA TTGCAGAAAA ATGGTTTGGT GTTGGTCAGT 3000 

CGGTTATTGG GATAGAATAA AGGTGCAAGG CAGCGGTGCG TTTTTTGCAC TGCGCTTTTT 3060 

TTGCGTGTGT GCAGGGGTGG GGGGGATGCT CTGGTCGTGT ACTCCTCGTG CAAGGGTGTT 3120 

TCACGCGCAG GATGCGTCGT TCGATGAGGC GCGCGTGCGG GGTACACTTG TGGTGGGCGT 3180 

CGGTCGGGGC TTGGCACCCT TGGTGGATGC TGCCACTTTC TCTGCCTTCT CTCTTCCTTC 3240 

TTCGGTTGTT CCTCCTCCTG CGCGGTGTTC GTTGCTTTTG CAGGAGGCGC GCGGCTACGA 3300 

TGTTGAGCTG TTAGCTCAAG TGGCACGTCG TCTCCATATG GACGTGCAGG TGAAAGTCGT 3360 

TCATTGGGAT GAAAAGGAGC GCGCCCTCCA TGCGGGGGTA ATTGACTGTA TCGCAGACGG 3420 
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ATTCACCTAT AC TGC AGATC ACGCGCGCAG ATTTGCACTG ACGCAGCCGT ACGTACGCGA 34 80 



TGTGCGCGTC TTTGCGGTGT TGCGCCAAGC CCCGTACGCA ACGGTTGCAG ACCTGCATGG 3540 

AAAGCGGCTC GGGGTCCACG CAtgACCGAT GTGGAAGAAA ATGATGCATA CCACGCGTTG 3 600 

TTTGGGCAGG TGAAAACGTA TGCCCACTAT GTTCAGGCAC TCACTGCTTT GTCGCGAGCG 3660 

GAAGTAGATG TGGTGGCGCT GAATTTGGTG ACGCTCTGCG CAGTGACGCC GCACCTGCGG -3720 

gCTTGTATCG AATTTTGGAT GAACCGATAG ACACGTGTGA ATACGTGTTT GCGTTTCGTG 3780 

CGGATGCGCG TGCTTTGCGC GACATGGTTG TGCGCACTCT GTCGCAGCTG CAGCGAGAGG 3840 

GTTTTGTGTC AGCGCTCTCA AAGCGGTGGT TTGGCAGCGA TATGTCCATC ATCGACCGCT 3900 

AAGGCGGGTG GAGGGGGAAT ATCGGTGGAT CCGTTGAATG CCGTTATTGT GGAGGGAAAT 3960 

GTCGTTCCAT CTGCTTCCGC GCGCGTGCCG GAGGgGCCGT GTGTGCGTTT TGCATTCAAA 4020 

CGCAACGGCG CGTGCAAGGG GAGGGGAGGG TGC AC AC AG A GGTTTCGTAT TTTGAAGTTG 4080 

AGGCATGGGA TGCACTTGCG CGCGTGTGTG CGCAACAGGT GCGGCCAGGA GTGGGGTTGC 4140 

GGGTGGTCGG CCGTCTCAAG CArGATCGTT GGCAGCAGGA GGACGGGGTG CGAGTGCAGC 4200 

GGGTAAAGAT TGTCGCTGAG CATGTAGAGT TTCAGACT 4238 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

TTCCGTTTCG GGATTGAAGA TTTCAATAGG TTTTCCTGAA TCGTTACCGG ACCACAGTGC 60 

CTTTTCCTTG CGTTGTGTAT TGTGTGCGAT AACGCCATTT GCGGCAGCCA AAATAGTTCC 120 

GGTGGAACGG TAATTTTGCT CTAAGGTAAT TTCTGTGGCG TAGGGGAAGT CTTTTTCAAA 180 

AGAGAGAATA TTGTCGTGGT GTGCTCCACG CCAAGAATAA ATTGATTGAT CATCGTCCCC 240 

TACTACACAG ATATTTTGTG TAGCGAGCAT TTTCATTAAA CGGTACTGCT GTGCACTGGT 300 

GTCTtGGAAT TCATCCACTA AAATGTAATG ATAGCGACTC TTATATGAAG CAAGGATATC 360 

AGGATATTCA CTGAAGATCT GGATtGGCAG TACGATCAAA TCGTCAAAGT CTACCGCATT 420 

AAATAATTTC AACGCAgTAT GATATTCGTG CCAAAGAGCA CGTTCCTTGT GCTGTAGCTC 480 

TTTTAAGTTC TTACGCTGCA TTTTAATAGC GGAAAAGAGC GTgcTCACGC AGTTTGTGTC 540 
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AAGAACTTCT GGCAGGAGAT GTACTTCCTT TGcTGcTTCG CGAATGAGAG CACGTTTATC 600 

ATTTTCATCA TAAATGCTAA AGTTCTTCCT CCAGCCGAGA ACATGGATAT GTTCTCGCAA 660 

AATCGTAACG CCAAAAGCGT GAAAAGTACT GACGGTCGTG TTGCGTAAAG GTTTGCCCGT 720 

GAGAGCTTTA ATACGTTCAG ACATCTCGTG TGCGGCCTTG TTAGTGAAGG TCAGTGCAAG 780 

AATGCGCGAT TGCAGAATAC CGCATTCGAG CATATACGCG ATGCGCGCCG TGATTACGCG 840 

CGTTTTCCCT GAGCCTGCGC CGGCGATAAT GAGAAGCGGT CCCTCaAGTG TGGTAACGGc 900 

TCGACGTTGC TCAGGATTAA GTGTAGAGAG CATAGACGCA CAAGCGTAgm AnAATAGGGC 960 

GGAAAAAAGA ATATCCGTAT GTGGAAGGGc ATAGACTGTG GGTAATACTT CAGATTGGGT 1020 

AGAGATGGAT GGATTGCGCT ATGTGTATGC CGCGCAnGGT GCGGCCCCCA TGCCTGCTCC 1080 

TACGGATAAT CCTGCTTGTG ATGCGCACAT GTCGCATGAC GTCATAGCGC GTACTGCCCA 1140 

AGCAGTTTTT GGTATTCGTG CGCTGTTTCC TTGGCAGCGC TTGGTAATTG CTAACATACT 1200 

GGATGCGGCG CATGCGTGTA CACATACAAC TCCGTTCGCT GCGGCAGGTT CTTCTCAAAC 1260 

CGATGCTACG AGGGTGACTC ATGTGGATGA CGCGCACCTG AGGATAATTT CGTCGGTGCC 1320 

ATGCAAGACA CACGTTTTGA TCAGGATGGC GTGTCACGCG CACATCAAGT GGTGCTATTG 13 80 

CCGACAGgTG CAGGAAAATC GCTGTGTTTT CAAGTACCTG CCCTCTTTTT AGAGGGGCCG 1440 

ACGCTAgTGG TGTACCCACT GTTATCGCTC ATGCGTGATC AGTGCCGTCG GATGCAGGCA 1500 

GTGGGATTTT CGGTCATCTT GTTACGTGGT GGACTGAATG CGCAAGAGCG CGCGTACATG 1560 

TATGCGCAtT GGATAgtGTG CTGAgGCGTA TGGCCGGATG CGAGgCGTTA cGCcTCCtGc 1620 

ACACCAGACG GCAGAATTTT CCCTTGCAGA TTCGATCTCT TTTGATGCGT CACTCTTCTC 1680 

TGATGACGTG AGTACCTTCT CGAAGGTGGT ACATGTGGAT GAACGTCTTG CTCAGAAAAG 1740 

GACAGAAAGT CGAAGAGGTG TATGCATCAT CGCAAGTCCA GAGATACTCA CACAAcCgck 1800 

CTGCGCCACG CGTGCGTGCA TGTCGCGTTG CGCATTTGGT TATTGATGAA GCGCACTGTG 1860 

TGTCCGAGTG GGGAGATTCG TTTCGTCCTG ATTACGTGCG ACTAGGCGAA TTGGTGCAGG 1920 

ATCTTGCGCC TCAAGTGGTG ACTGCATTTA CGGCGACTGC AAGTCAAACA GTGCTTGCGC 1980 

GCATCATGGA AGTGCTGTTT GGCGGTCGTG CGCACGTGTT GCAGGGAACA GTAGATCGCC 2040 

CGAACATTCG ATACACCGTA CGCACGGTGC TGTGTAAGCA GACGGCACTG ACTCAACTTG 2100 

TAGCGCGTTG TGTGCGCCCT GCAGTTATTT TTTGTGCTCG TCGGGTACAG GTGGAGCGTG 2160 

TAGCCCACCA TTTGCGCACG TGTCTTTCTG ACACACAGAT ACGTTTTTAT CACGCAGGtT 2220 

GCAGAGGGAA GAAAAAGAAA CAGTGGAGCG ATGGTTTCAT ACCCATGATT CTGCCGTTTT 2280 
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GGTAACTACT TGCGCGTGGG GAATGGGAGT TGATAAGCCG AATGTACGTA CGGTCATTCA 
CGTGGATGCG C C AC TGACTG TGGAGGCGTA CGTACAGGAG GTTGGAAGAG CAGGACGGGA 
CGGAATGCGT GCAGACGCAT TTTTATTGTG GTCACCTCGA GATGCTCGCT CGATAGAAAC 
ACTGCCGCAT GCACAACGGG TGCGTGCGCA CGTGTTGCGC CACTTTGCTG AAAGCGGACG 
TTGTCGCCGC GCAGTTTTAC TTGAGTCTTT GGGGGAACAG AATGTGTGTG CCGGATGTGA 
TGTGTGTGCA GGCACTGCAC GTTTTGTATG TGAGGATGTA GAAGCGCTCT TACAGTTTTT 
GAAAAAGAAT GCGCGCAGAT TCACTGTATC ATCGTTGGTG CAGCACCTCG CGCTACATCA 
GAAAGTGCTC AGTGTGGCGG ATGTACGTGC CTTGCTATAT TACGCGCTCG AAACAGGACG 
TGTGAAAAAA AAACATTCAC TCTTGTGGGG TGATGTCCTG TATGTTGCAC GTTAACGATT 
CTGCGAGCAA ATCGTATCTG CAGGAAAGCA AGAAGGATGG CGAGAACATA CAGTTGCCTT 
GTATATTCCG CGAGTTACGC GCATTTTTAT GGCCGAGTGG TTAGCATTAC TTCTACAGGT 
TTTTTATGCA ATTTTATATA CTGAGCCGTC GTTCATGTGT CTCCGATACG GTGTGGTCTA 
GGTTCCGTAC gTGCGGGCAC GGAACACATC GAGCGGACGC GTCTGTTCGT GGAGGATATT 
ATGAAAAGGT TTATTCCCCA TCGGGTGATT CACGCGGTGT GTATCGGGCT TGCACTTGTA 
GGTTGTAGGA AACTCGATTC TCGTGCGGGG GATTTTGAGT TAACGATTAT ACATATCAAC 
GATCATCATT CGCATTTGGA ACCAGAACCC TTAGAGCTTG CAGTGGCAGG GGAAAGACTC 
AGAGCGGCTG TAGGCGGTTA TGCGGCGCTT GTGCACGAGA TACAACGGTT GCGTGCGGAG 
TCGAAGAACG CATTGGTACT GCATGCAGGA GATGCACTCA TAGGTACGCT GTATTCTACC 
CTCTTTAGAG GGCGTGCGGA CGCGGTGCTG ATGAACCATG CAGGATTTGA TTTTTTTACC 
CTTGGCAATC ACGAATTTGA TAATGGGAAT GAGGGACTCA AAGAATTTCT GCACTATTTG 
GAAGTGCCAG TTCTCTCTGC AAATGTGGTT CCTAATGCTG CCAGCACGTT GCATGGCTTG 
TGGAAGCCGA GCGCTATTGT GGAGCGTGCA GGTGAGCGTA TTGGGGTTAT CGGACTTGAT 
ACGGTAAAGA AAACCGTGGA GTCATCCAGT CCCGGTAAGG ATATCAATTT TATTGATGAG 
ATAGAGGCGG TGCGTCGTGC AACTGTTGAA ATGCAGCaGC AAGGAGTAAA TAAAATAATC 
CTCCTTTCTC ATGCAGGTTT TGAGAAGAAC TGTGAAATTG CTCAGAACAT TTCTGGTATT 
GACGTCATCG TGTCAGGTGA TACCCACTAC CTTTTGGGGG ATGAATCACT CGGACGGCTA 
GGTCTTCCGG TAGTTGGTGA ATATCCCAGA AAGATTATGT CCCCTGCAGG GGAGCCTGTG 
TATGTGGTAG AGGCGTGGGA GTATGGTAAG TGTCTGGGCG AGCTGAACGT AGTCTTTGAC 
CGAACAGGAG TAATAACGAG TGCAGTAGGC ATGCCGCGTT TTTTGTTACA TACGAATACA 
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TTGCAAAAAA AAGGAGCGGA TAGAAAAAAT TATCCTCTTG AGGAGGCAGA GCGTGAAGCG 
CTGCTTGTGG CACTGAGGAT GACGCCAGAG ATTATATTTG CGCAGGAGAA TGATCAGATT 
ATATCTGTGT TGGAAGAATT TAAAAAGGAA AAGGAGGCGC TTGGTGCGCA GGCAATTGGC 
GTAATTACCG GTGCCTCAAT GCGAGGkGGn TCTGTGCATC GAGTTCCCGA TGCACAGAmT 
CCACAGGGTT CGGTTGCAAC GCGGTTTGTA GCAGAGACGA TGCTCTCAGA CATTCAAAGT 
TTTGGTGCGG GGAAGgTAGA TTGCGTAATT CAAAATGCAG GCGGTGCGCG GTCAAATATT 
CAGCCTGGTG AGATTACGTA TAATGACGCA TACACGCTCC TCCCCTTTAG TAACACGCTG 
GTGTTGGTGG ACGTCAGCGG TGCAGAGTTG AAACAAATTA TAGAGGATGC ATTGCAGTTT 
GCACTTGGTG ATGGTTCCAC GGGAGCCTTC CCCTATGGGG CGGGTGTCCG GTATGAAGCG 
CGCCAAGAAC CAGATGAACA TGGCAAACGA GTGATAAAGC TTGAGGTGCA AAAAAAAGAT 
GGAGCGTGGG TGCCAGTAGA TGAGCGCGCG CCGTATCGGT TGGGTGTGAA CTCGTACATT 
GCGCGGGGAA AAGACGGATA TAAAACGCTC GGAGAGATTG TCAGTACGCG CGGAcTGAGG 
ATACGTATCT GCGTGATGCG GAGTCTTTGA TTAAGTTTTT GCGTGCGCAT AAAAATTTTC 
GTGCATACAC AGATTCCAAT GTGATATTCC GTCTTAAATA GTAGGAAGTA ACTTACATTA 
GAGGcCTGTA AAGAACTACG TTCTTTACAG GCTGTGCCAA TCTGCTTTTC CGGGAAAGAC 
AAAGGGTATG CCACGTTAGG AGCGGAAAGA AGGGTGCTGC ACATAACCTT ATCTTTGCGA 
TTGACCGTGG TATACTCCTT GCACCTTATG CAAGAGAAAA AAACGCTTTA CCTTCTTGAT 
GCCTACGGAC TTATTTATCG GAGTTACCaC GCGTTCGCGC GTGCGCCGTT GATTAACGAC 
AGCGGTGCGA ATGTTTCTGC CGTATATGGT TTTTTTCGGA GTTTGCACAC GCTCCTGTGT 
CACTATCGAC CCCGTTATTT TGTTGCTGTT TTTGATTCTC TCACGCCTAC CTTTCGGCAC 
GTACAGTACC CAGCCTATAA GGCAAAAAGG GATAAGACTT CTGCAGAGCT TTATGCGCAA 
ATTCCCCTTA TCGAAGAAAT CCTGTGTGCA CTGGGCATTA CAGTTTTGCG TCATGACGGC 
TTTGAAGCTG ACGACCTCAT TGCAACCCTA GCAAAACGAG TTGCGGCTGA GCACTGTCAT 
GTTGTGATTA TCTCCTCAGA TAAAGATGTA CTTCAGCTTG TGTGTGATAC GGTGCAAGTG 
CTCAGACTTG ACATAGATCA TAAGTGGACA TGTTGCGACG CTGCGTACGT ACAGCAACGG 
TGGACGGTCA TGCCAACACA ATTACTTGAT TTGTTCTCTC TCATGGGAGA TTCCTCCGAC 
AATGTGCCTG GTGTGAGAGG GATTGGTCCT AAGACGGCTG CACATCTTCT CCAC TGTTTT 
GGCACACTTG ATGGTATTTA TCGTCATACC TATTCCTTAA AAGAAaGCGc TGCGCACGAA 
GATAGTGTGT GGGAAGAAAG ATGCATTTTT TTCTCGTTCA CTCATTGAGT TGCGTGACGA 
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TGTACCATGT GTTTTTTCGC TCGAAGATTC CTGTTGTATT CCGCTCGATG TAACGTCTGC 
TGCACGTATT TTTGTGCGAG AAGGATTGCA TGCGCTTGCA CAACAATATC GTGCTTGTGT 
GCAAGAAATA GATACAGAAG CAACAAACGA TACATTACAA ATGACAGAGT CTTCTGTGCT 
CACGTCTGGT CGATGTGCAA ATGAGTGTTT CTTATCTCAG GTAGAAGGGA GGGCTAGTAC 
ACCGGAGGTG AaCTCCGTAT TGAAGTCGGA GTTGAAGACG AGTGCTGTGT CTGGCGCCAT 
ACCTATAGAA AaTAGAGATC TTAGGCAGGA TGTTATGCTT gCACGCAGTG CaGGTCATTA 
TCGTGGTGTT ACTGACCCTG TAG AAC TT AA AC GT ATT ATT GATTGCGCGT GTGCGAATGG 
TGTGGTCGCG TTTGATTGTG AAACGGATGG ATTGCATCCG CACGATACAC GTCTGGTCGG 
ATTTTCGATC TGCTTTCAGG AAGC AG AGGC TTTTTATGTT CCTCTTATTG TTCCGGACGT 
TTCTCTTCAT ACCGAGTCAA CTCAGTGTAC ATGTGCACGT AGCACTAATG TCGAGACTGA 
AAAGGAGTGC ACAGAACAGC ATGGGGTATC TGCATCTGCT GTGCAGGATC CGGCATATGT 
CCAAGCTGTC ATGCACCAGC TTCGACGTCT TTGGAATGAT GAGACGCTCA CACTTGTTAT 
GCATAATGGA AAGTTTGATT ATCACGTTAT GCATCGTGCA GGCGTTTTTG AGC AC TGTGC 
ATGTAATATT TTCGATACGA TGGTTGCAGC TTGGTTGCTG GATCCCGATC GCGGTACATA 
CGGTATGGAT GTACTTGCCG CATCATTCTT TCAGATCAGA ACGATTACAT TTGAAGAAGT 
GGTAGCAAAA GGGCAAACCT TTGCGCACGT CCCTTATGAG TGTGcAGTCC GCTATGCAGC 
GGAGGATGCA GATATTACTT TTCGTTTATA CCATTATTTA AAACTCCGCT TGGAAACAGC 
AGGATTGCTT TCTGTGTTTG AGACCATAGA AATGCCGCTT TTGCCTATCC TAGCACGTAT 
GGAAGAAGTG GGGATTTTTT TACGTAAGGA TGTTGTGCAG CAGCTCACTC GATCTTTTTC 
AGATTTGATC CAGCAGTACG AGCACGATAT TTTTTCTCTT GCCGGTCATG AATTTAATAT 
TGGTTCTCCG AAGCAACTGC AGACAGTCCT TTTTCAAGAA TTACATTTAC CGCCCGGTAA 
AAAGAATACT CAAGGTTATT CTACTGATCA TTCTGTATTG AAGAAACTTG CACGTAAGCA 
TCCCATTGCA GAAAAAATAT TGCTCTTTAG AGATCTTTCA AAGTTACGTT CGACGTATAC 
CGAATCGCTT GCAAAACTTG CTGATCAAAC AGGGCGTGTA CAT ACT AGC T TTGTGCAAAT 
TGGTACCGCA ACTGGAAGGC TTTCGAGTAG AAATCCAAAT TTACAAAACA TTCCCATTAA 
AAGCACAGAA GGAAGAAAAA TAAGGCAGGC GTTTCAAGCT ACTGTTGGGC ATGAGTTAAT 
TTCGGCAGAC TATACACAAA TAGAGCTGGT CGTGTTGGCC CATCTATCTC AAGATAGAAA 
TCTTCTCAAT GCATTTCGAC AGCACATTGA TATTCATGCA TTGACTGCTG CATATATTTT 
CAATGTGTCT ATAGACGATG TACAACCTGC AaTGAGAAGA ATCGCAAAAA CT ATT AAC TT 
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TGGAATCGTG TATGGAATGA GCGCTTTTAG ATTGAG TGAC GAACTTAAAA TTTCTCAGAA 
GGAAGCGCAG AGCTTCATTT ACCGTTATTT TGAAACGTAC CCGGGGGTGT ATGC TTTTAG 
TACACAGGTT GCAGAGCAGA CACGTAAAAC CGGCTATGTG ACTAGCTTGG C TGGAAG AC G 
ACGC TACATC CGTACTATCG ATAGTCGCAA TACGCTTGAG CGCGCGCGTG CCGAACGTAT 
GGCGTTGAAT ACTCAAATTC AGAGTTCTGC GGCGGATATT GTGAAAATTG CCATGATAGC 
AATC CAGCGT GCGTTTGCGC GCCGACCGTT ACGTGCACAA TTGTTGCTGC AGGTACACGA 
TGAATTGATT TTTGAGGCGC CAGCTGCTGA GACAGCGATA GTGAAAGAAA TTCTCTTTGC 
TGAGATGGAA CATGCTGTTG AGCTCTCGAT CCCGCTGCGT ATACACGTGG AGTCTGGAAA 
TAG TTGGGGT GATTTTCATT AGCATACCCA TCTGAGGGAT GCAACAGGGC ACGTTATGAG 
GTTACCTCGG CGCGTAGTTC CTTAAAAAAT GATGCTACCA CGCACAACAT AATCAGCGCT 
AAAGGAAATG CCGCAATGAT GGCTAAACTT TTCAGGTGCA TGAGTGTGGA CTGGGAGAAT 
ATGAGAGAAG CGGGAAGGAG AATGC ACGC A ACCGCCCAAA ACGATTTCAT TATTTGACGT 
GGTTCTTCTC CCGGTGCAAC GCTTTTTTGC GAATAGGAAG CGATGATGAG CGTTAATGCG 
TCAAAAGTAC TTGCATAAAA GGCGATCATG GTAGCTGCCA ACAGCGCCAT AACGATGTAC 
GCGCAGGcAG TGTCTGAATA ATTGCGATAA TCACCTCAGC GGGTGTATTC CCCGCGCGCA 
nAAGGTACGC GGCAGGAAGG AGGTGGTGCG TTTGTAAATA GAGCCCGTAG TTCCCTAAGA 
CGATGAAGGA GCCGTACGTA CCTGCGATAC CCCAGCAGAG CCCTCCGACG ATGGTATTCC 
GGATGGTTCT CCCTTCGGAT ATCGCGCCGA TGAAGAATGG GGTTGCAACA GACCACGTGA 
TCCAATACGC CCAGTAAAAG ATAGTCCACC GCTGTGGAAA TCCAAGCGTC CCATCCGTTT 
CCTGTAATGA AATACGAGAA GGATCCATCC ACGTTGCCAT AAGAAAGAAG TTTTGTAGCA 
TTTTCCCTAT CGCGGTGATA CCCGTCTCGA TAAGATACAC GGTTGGTCCT GCACACAAGA 
AGAAAACAAG AACGGTACTA AAGCAGTACA CCGCGGCACG C g AGAGCTTT GAGATCCCCT 
GGGTACCCAG CAGTACTGCT GTGGTGTACA CCAACGCAAT AACGCAGAGC AACGCTAAAG 
CGAGCAGCTG GGTGTTAGAA ATACCGAACA AGAGAGAAAC CATGAGCGAA AGGAGGGGCG 
TTGCTAAGGA AAACGTGGTT GCAACGCCTA GAAGCAATCC CACTACAGAA CAGATATCGA 
TTGCTTCTCC TATTATTCCG TCTACGTACG CGCCCAGCAG CGGACGACAG GCTTCAGAAA 
TTTTGTGTGT GTGTCGCTTC TTTACATGCA ACATGTAGCC GAAGcaCTGC GAGAAGAACG 
TAAAAAGACC AAGGTATGAT GCCCCAATGG AAAAGGGGAT ACGCTGCCGC CCATTCTTGT 
CTTTCCGTGG GAGGGGAGTG TTCTGCTATA AACGGAGCTT GAGTGAAGTA GTGCGCCCAT 
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TCGATGAGCG ACCAATACAA AATATCCGCC GCCATGGTGG ACGTAAAAAT CATGGAGCCC 
CACGTAAAGT TGGAATAGCG TGCGGTGCGC GTAGTACCCA GATACACCGC ACCATACCGT 
GAAAAAGCAA TAGTCAGCGT CGTTCCTAAA AAAAACAGTC CGGTAAGGAT ATAGAAAAAG 
CCCAGTTTGT TTACCAGTAT ATTCAAGAGG GTACCGATTA CCCGATGAGA AATGTCAGGA 
AAGGAAATAA ATAGGAGTGC GCACGAGATC ACGATTCCCA ATGGGATGAG AGAGACGCTG 
AAGTCACATT TTTCTTTCTG CATGATCTGC CCTATGTGTT TTCAGGAGCG TCTGAACTGC 
TTCGTAATAT TCTTTTGCAT ATCGGTACAT TTTCAGTCCA TAATCTCCAA ACGAGATACC 
AAGCGCCTGT TTGTAGCTCG TCCACAAAGA CCACAAAAAT CCCCCAAGAG CAATATAGCA 
AAAGACGCGC AGGCGCTCTT CTCCGTGCGG ATTACGCTGA AAATAACGAT ACATGAGATC 
TTCTATCTGG GCGGTGTTAA AGTGGGCATA CAAAGAGAAC ATCGCGATAT CCACAAGCGG 
ATCACACATG CCCGCATATT CCCAATCGAT GAGCTGGGCA CTGCCATCGC AAAGCAGGAA 
GTTATCTGGG GTTAGATCGA CATGCGTAAG TACGCACGGT TTGTCCACTG AATCGACAAT 
AGCCAAGAGT GTATTCATTT TTTCTCTCAC TGAACGATAG TCAGCGTAAA GAAT AC TGT A 
CTGGTCGAGG GCAAGCTTTT CGTAATACGC AATACGCGAT CTGAAGTCGA ACCTATGGGC 
AACACAAATA CCTGATTGAT GGAGTTTGCG CGCAATCCTC ATGCACAAGT CAAGGTCTGC 
CGGATCACTA GGGTTTGCAC TGCGGCAGTC TTTGTGGAAG ACGGTGATTT TAATACCACG 
GGCAGGCTCA AGGTGAACGA GAGTGTCACA AATATTAAGG GGCTTAATTG CTTCATACAC 
GGCAGCTTCT TGGAAGCGAT TGACAAGGAG TTCGGTACCC TCTCCAGGAA TTCTGAATAG 
GTACGGCTTA TCATTCAGCT CAAAGATAAA AGATTTGTTC GTCATTCCTG CTTTAAGTGG 
ACGCAGCTTA CCGATTGTCA TTTCGGGTTG GTTGAAAACA TGGGAAATGA CGCGCATCCA 
CTCGTTATCG CTCGAAACCA TGTACGCAAA ATCAAACTTG CGCAGcTCAT CGAGCGACTC 
AAATTCGTAG ATAAGGTTGT CAGCCTGTTT GTTGGCAAAC ATGGTAAGGG ATTTTGTATT 
GCGCATAAAG ACGTCTTCCC AGAACCACGC GCGGTTTTCT GGGCGCATGT AGGCAGCTTC 
GATGAGGGGG ATGATCTTTT TAGAAAACGA TGCAGAAAAG TAAGCTGGAC CGTACATGAT 
CCACCCACTT CTGCCACCAA TTTTCACGGC GGTAATTTTG TCGTACAATC CGGTTTTTAA 
CACCCATTCC TTTGTTTTCC CTTGTGTCTT GACAGCCGTA TACCAGGAAT CCCACTCATG 
GGAGTGATAA ATGTTTTCGC GTAGCcAGTT ATCGCTTGAG AGTATGTAGG TATTTCGAAG 
TAAATGACGG GCGTGGTAGA GGGTGGATAG GTTGTTGGCA GTTTCATAGT CAGGGTTATA 
GGTGAGCGTC ACGTCATGCT TGTCGACGAG GTACTCAAAG GCCTCTTTGA GGTAGCCgAC 
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GACGACCGTG ATGTCAGTAA TGCCGACTTC GTGTAACTGA CGAATTTGAC GCTCAATCAT 11040 

GGGCTCACCA AAGACTTTCA GCAGGCCCTT TGGAGTAGCG TATGTAAGAG GCACGCAACG 11100 

GGAGCCAAAG CCTGCAGCCA TTATTACCGC ATTATGAACC CGGCAAGACT GATACTGACT 11160 

ATGTGCAAGG GGGGTGAGAG AaCGCGTCCG CGCACCAAAG GTCCCGTGGG TTTCAAGGAG 11220 

TCCTGCAGAC TCCATAGCGT GTAGAAGGCG ATTGGTGTAA GCCAAGGACA GGCGCAACGC 11280 

CTTGGCGATG TCACGCTGGC AAAGCCGAgG CGCATCGCGC AAAAGCTGAA AAATCTGGAA 11340 

AAAACGACGC CCCACACTGA GGAAGTGTAC GGGCGTGTTC GOCCGCCGTC AAGCACGCAA 11400 

AAAACAGGGG AAAAAGCACC CCCCCCCCTG CTCGCTTCCT GCACACAGCT GTGAGGAGCG 11460 

CATCCTTCGC TC T CC T AG AT AATATTTTTC ATCACAATCG GCTGTCCTGT TGCATCAAGG 11520 

GCGTCGCGCC ACAACGATCC TTCAGGATTT ACGTGTTTGC GTTGGCACAC CACCACATCG 11580 

ATTGGGAGGT GTACAAACTT ATTGTGCACC AAGCCAATGA TCATTTTCGT TTTACCGCAC 11640 

ATCGCCGCAT GCACCGCATT GTTACCGAGG CGTTCGCAGT AAATCGAATC TATGGGCGCA 11700 

gcAACCGCGG aACGAATCAA GTAGCTCGGA TCGATGTACT TTAAATTGAT GTGTATACGC 11760 

TTTTCTTTGA AATAGACTCC AATCTTTTCT TTTAAGAACA AACCGATATC CGCAAGGCGC 11820 

TTGTTACCCG ACGCATCCGT GCCGCTGcTC ACGCGCAAAC TACCTCCTTG GGAATCACCG 11880 

GAGGGCACAC CGTCCGCATT TACCATTAGG TCTTGCCCCG CACCTTCTGC TACAACGAGC 11940 

ACCGCATGCT TACGTAGCGC GATTCGCTTC TCTAGGTGAG CCAAAAGCCC ATTTGGACCG 12000 

TCAAGGTCAA AGCTCACTTC AGGGATGAGT ACGAAgTTTG TCTCATGGCT CGCAATCGCC 12060 

GTGTACGTAG CGATGAATCC AGACTCACGC CCCATGAGTT TGACCAGTCC AATGCCGTTA 12120 

ATCTGTGAAC GAGCCTCCAT GTGCGCTGCG GCAACTGCCT CTGTTGCTTT GACAATAGCA 12180 

GTATCGAAGC CAAATGACTT TTGAACAAAA GAGATGTCGT TGTCCAcCGT TTTGGGAATG 12240 

CCGATGATGG AAATCtTAAg GTTGCGGtGT TTTATTTCGT CGGCAATCTC TTTTGCTCCC 12300 

TTCTGACTCC CATCCCCCCC AATGaTAAAG AGAATGTGCA gGTTGAGCCG CTCGATACCA 123 60 

TCGAAGATGT CAAnCAnAAG GTTCCCCCCA ACGCGnGAnG TGCCTAAGCA G 12411 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 971 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

ACCACCGCAA CAATCGGGAT ACCAACTCGC CTCGCTTCAC GGATAGCGAT AGTCTCCTTA 60 

CGCGTATCAA TGATAAACAC TACACCCGGC AGCTCCTTCA TTTCCTTTAT GCCGCCCAAA 120 

TTCTTTTCTA GTTCGCGTGC TCCTTGCGTA AAGACGCCAC CTCCTTCTTA GAGAGATGCT 180 

CGAAmGTACC GTCTATCTCC ATGCGTTCTA TCTTCTTGAG AC GC GAAAG A CTCTTCCTTA 240 

TGGTGGAAAA GTTAGTGAGC ATGCCGCCGA GC C AACGGTT AG TCAC AT AA AACATCCCAC 300 

nAGCgcTGCG CTTCCTTGGC AATGGTTTGC TGCGACTGCT TCTTTGTGCC CACAAACAAA 360 

ACGGACTTGC CTGAGGAAAC AGTCCTGCGC ACCATGTCGT ACGCCTCGCG GATGGCCGTA 420 

ATCGTCTTTT GCAGATCAAT GATGTGAATG CCGTtACGCT CCGCGAAAAT ATACTTTTTC 480 

ATCCGCGGAT CCCACCGCCT GACCTGATGG CCAAAATGAA CCCCAGATTC AAGCAGATTT 540 

TTGATAGTCA CCACTGCCAT ACCATTCCTC CACAGAGAGC GCCGAGGCTC TCATCCTTCT 600 

CTCTTTCACC ACACGCACAC GCGCCGGGGA TATTCTCGAG AGGACAGAAA AACGCGTCCC 660 

CTCACACACG GTCACCTTGA CTAAAATCAG ACGTATGGAA TACGGTTGCC CGCCTCTTCT 720 

GTGGACCTTC CCAGGCCCGC AG C AGGTT AC AGAGCAGGGA AGATTAGCTC ATTCGGATAG 780 

AGCGTTGGCC TCCGGAGCCA AAGGC GnTGG GTTCAAATCC CGCATCTTCC ATCCTCTTAC 840 

CCACGTAAAC GGGCGCCCCG CGCmTrACGC TCCTTCCAGT CAAgCTGGCA ACTCAGGGGc 900 

CGTCCGCCCG CTCTCCTTCA GCAGGTCACT TGC CGCC AG A AACGACTCCC TTCAnCACTT 960 

CGAGCGCGCG A 971 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1985 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

AGTGCCGACG CGCGAGAACG TATTTTTGCC CATC TTG AAA TGCGTGGCGC AGGTGCACGG 60 

CGCGTCACCT ATCGCCTACG CGACTGGGTG TTCAGCCGTC AGCGCTATTG GGGAGAACCC 120 

ATCCCTCTTG TGCACTGTCC TTCcTGCGGT GTTGTACCTC TCCCTGAGAG TtGCCTGCCG 180 

CTTTTGTTAC CCGAAACCGC CGATTTCACT CCCACGGAAG ATGGGCAGGG CCCCCTTGCA 240 

CGAGCGCGCA CGTggTGCGC GTTCCCTGTC CGCAGTGTGC ATCTGACGCA GTGCGAGAAA 300 
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CAAACACAAT GCCCCAGTGG GCAGGATCCT GCTGGTATTA CCTCCGTTAT ATGGACCCCC 
GCAATAAGAC TGCCTTTTGT GCACCCGAGA AGGAGCG TTA CTGGGCgCCA GTGGCGTTAT 
ATGTAGGTGG TGCAGAGCAC GCCGTACTGC ATTTACTGTA TGCACGCTTT TGGCACAAGG 
TATTGTACGA CTTAGGTCTT GTAAGCACGA AAGAGCCCTT TGCGCGGTTG GTGAACCAAG 
GCATGATTAC GTCGTATGCA TATCGCAGGA AAAATGGCGC GCTTGTACCT CACGACGAGG 
TGCACACTAA TGCTCAAGGT ACCTACGTGC ATGCTCGTAC GGGGGAAAAA CTCGAGTGCG 
TTGTGGCAAA AATGTCAAAA GCGTTAAAGA ATGTCGTCAA TCCTGATGAC ATGATTGCAG 
CGTACGGTGC tGACGCGTGC CGGGTATACG AGATGTTCAT GGGACCTCTT GAGGCTTCCA 
AACCGTGGAA TACGCAGGGG TTAGTGGGGG TTTTTCGGTT TTTAGAAAAA ATTTGGGTAC 
TTGCGGGGCG CGTGGCGGCC GCAAACGGTA TTC CACAAG A CTCTCGTGCA GAGCCGCCAG 
GTGACCTGCA CGCACAGAAA AAGTCTTGCA GCATGTACGC CCTCGAAACG CTGTTACACC 
GGACTATTCA AAAAGTGaCg ACGATACGTC GGCGCTTAGT TTTAACACGG CAATCAGTCA 
GATGATGATA TTTGTAAATG AwGgTACGCG GGTGGCGCGG AGGATGCCTC TTCCCTCTAA 
AATGTGGGAG ATGTTTGTAA AAATCCTCTC TCCCTATGCA CCACATTTGG CAGAAGAACT 
CTGGGAAATG TGTGGGCACA CGCACACTAT CGCATATGAG CCTTGGCCAC AGGTGGACCC 
TGCGCGTGTG GCGCCGCATG TGTGCTCCGT AGTGGTGCAG GTGAACGGTA AAGTGCGCGA 
CACCTTCTCC GTAGCGCCGA ACGCTCCAAA TGAGGAACTC GAGCAAAAGG CGCGGGAAAC 
CGCCGGTGCG CGTAAGTTTC TTGGTACGCA GCAGCCAAAG CGCGTAGTGA TAGTTCCCAA 
TAAATTAGTA AATTTCGTTC TGTAGTCCGC ACTGCTCCTG CAGCGTTGTG CAGTACCTGT 
GTGGTGCCCT ACCCGCGTGC ACTACAATCG CGTAAAGGCA CAGCTGCATC AGCGGCCCTT 
GGAGAGGCGT CCCACCTGAG CGGATCATTG CGTCTGTCTG GAACAATCTA TCCAAAATAG 
CATGCGTATC GCTCAGGGTC CAAAGAGACG CAGCGCGCTG GTATTGCGCA ATCATACGCC 
GAGAAGTGAA CCCATAGCGC TTTAGCGTTG CAGGGTGCGT TTTGTCCTCC GGTATATGGT 
GCCAATGCGC AAGGCGACGG AACGCGTAAG nAGtCCTGCA AGGATTTGTA CAGGGGCAAC 
GTCTTTCGAA CAAAGCAGCG TGTTGAGAAT CATAAGTGAA TGCTCAAGAT CTCGTTTGGA 
CAGCGCATCG AATAAGGTAA ACGATGTCTC TTCTTTTGTG TGCACCAGCA ACGAACTAAT 
GTCGTGCGCA gTGATGCGGc GTCCTTTTTc AAAAAAAAGA GAAAGCTGCG TACAAACAGT 
TTTGAGCGCA CGAGTGTTGT TCTCCACCAA CTCAAGAAGA GATTCGATAG CCTCCCTGTC 
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(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1043 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

AGGGAGTGGC TTATGGGATA CGGTTCATCA GCTGAGGAAA CGTCTTCTAC ACCGCATGCG 60 

TCAGGCCAAA GGAAGGTTGG TTTTCTGTCC CTGCGCACCA AGCTCGCGCT GGTGTTTGGG 120 

CTGCTTGCGT TCGTGTCTGG TCTGGTCCAG GGCGGTATAT TGGTCGTTTT TGCGCGTAAC 180 

TCCATAGTTG GGGAGATTTC TAGTCACCTT GCTGGCCGTG CTCGGGATAC CTCCTCCATC 240 

GTGGAAGGGC GGATCGGCGC GCTGTTCCAG TTTTTGGAAG GTTTGGCACG TCTTGAAGTT 300 

TTGCAAGGCT CGTCCGACAG GCGCCGTGCC CAGGTGGACA GGCTAAAGAA GGAAGCGTTT 3 60 

TTTAACCGGG ATATCGCGCG TCTTGCGGTG GTAGATCTCG cAGGCGTGTT GTACGGGGAG 420 

GACGGGCGCA CGCATTACGT ACAAGATCGA AAGTACTTTC AGCGGGCGGT TAAAGGCCGT 480 

TGTTACGTCT CTGCGCCCTA TCCCTCGCGT TCGTCGGATG ACATGGTCAT TACCTTTTCC 540 

ATCCCGGTAT ATGACGAAaT CGGCGGGTTA TgCCGTGCTC GTAGCGGaTG TGATTTGGAC 600 

GTGGCTGTGT GATATCACAG GGGATTTTTC TGTAgGgGGG TGGGGAGAAT CGCCGTTATT 660 

GACGAGGTTG GTACCGTTGT CGCGCACCCA CGTCACGAGG TAGTGGCGCA CAGACAAATT 720 

ATATCCGCCT GGCAAAGGAA GACCCGGCCA CGTACGCGTC CGTCGCAGAG TTCGTTGAGA 780 

AGGTTATCAA GTCAGACTCT ACTGCCTCTC ACGTGTTCTC GTATGAAGGC TTAGAGAAAA 840 

TCGGTTCATC TGCCAAGATG AAGAGCACAG GATGGACCGT CGTGGTGTTT GTGCCTGTCT 900 

CCGAGTTTAT GGGGCCTGTG TACACCCTGG CAGAAC TACC TGCTTGCGGT GGGTATCATG 960 

TGGTACTCTT CTCCCTCATG TGGTGTATGC CGTTGCGCGC AAGATTGTGC GCCCGCTACG 1020 

CTCTACCGTC AGGTGTTAGA AGA 1043 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

CnTCTnTTAT TTATTTTGAA AAAGACACGT TGCTTTTCCT GCGCGAATGC GTCAGAATGG 60 

CGCGGTCCCT ATGATATCGT GTTCGGTGCG CAGGAGGCCG AGATGGGAGC CTCAGGTCGG 120 

CGCTGCGTTT CTTGCCTTTG CGCTCTTACC GGTCCTGGCG AGCGGACGTG GTATGCAGGC 180 

GGCAGTGGCC ACAGCCGCAG GGTCCAGTGG TTCCGGCAGT GATGGCAAGC ACCCCGGCAA 240 

GGAACAGTTT CTCCAGTTCC TCATTCCATC TGGCGGTCGC TACGAATACC TCGGGGTGAG 300 

CTTTACAGCG CTGGCAGATG ACGCCAGCTT CTTTGAAGCT AACCCTGnCG GCAGCGCCGG 360 

GCTCAGCCGC GGGGAAGTTG CTCTGTTCCA CCACTCGCAG ATCCATGACT CACACACCGA 420 

AACGGTTTCG TTTGCGCGAC GTACGCAGAA CACCGGCTAC GcGCCTCCGT GCGCGCCTTC 480 

TCTTCTGAGT CAGATCTCAA GTCCTTCTTC GGGGGCAACA GTGGTGGCAA TAAGAACGGC 540 

GGACACCAGG GCAAACAGGG AAAAGGCTTC GTGGCAATAG CCAATGCGTC TCACACCTTC 600 

TGTGGCCAGT ATCGCTTTAA GGGCGTAACT TTGGCTGCAA TTTCAAGATG GGATTCCGCA 660 

AGGtAAAACT GACAGCCACG TGACCGTCGC GGGTGACTTG GGCCTGCGCG CTGCCTTTTC 720 

TGTGGCAAAG AACTTTGGCT CAAATGAGCC GAACATGCAC GTGGGGTTGG TGCTCAAAAA 780 

TGC CGGGATC TCGGTAAAAA CAAACAGTTG CCAAGTCGAA CACCTCAATC CGGCCATTGC 840 

CGTCGGCTTT GCCTACCGGC CGGTGTATGC GTTTTTGTTC AGTCTCGGGC TGCAGCAAAC 900 

CCTCACCAAA AGGGAGTCGC CGGTGTGCAG TGTTGGGTTC ATGTTTTTTT GTACCCAACA 960 

CGTTACCCTC CTCGCCTCTG CTGCGTGTGA AGGAGGGGCC TACGCCCTCT CAGGCGGCGC 1020 

AGAAATCCGC ATTGGCTCCT TCCACCTCGA CATGGGGTAC CGGTACGACC AGATTTTCCA 1080 

AGCCGCCCAC CCACACCACG TGTCAGTAGG GCTGAAGTGG CTCATACCCA ACGGCGGCAC 1140 

CCAGGCGGAT CAGGCCCTCT TGGTCAAAGA GTCCTATCTA GTGGGGCTGC GCTTTTATGA 1200 

CCAGCGGCGC TACCAAGAAG CAATTACTGC GTGGCAGCTG ACGCTGCGCC AGGATCCGGG 12 60 

CTTTGAACCG GCTGCTGAAr GCATCGAgCG CGCACGACGC TTTTTAAAAC TACACGAAAA 1320 

ACTTTCTCTC TTTGATATTC TCAACTAGCC TGCCGTG 1357 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2442 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 94: 
CACCGaCCCT GrAATaAAGC GTTACAAAAC CTACTCCTAC CATAAGGGCG ACAAGCAGTA 
TGAATACAAA ACTGTAACGC TCTTGATGTA CATTTTTCTC GATGGAAATC ATCGGTCTCA 
TCGCTCTTTC CTCAAAAGAG GTTCAAGCCG CTCGAGCGcA AcACTGCGTG AACCTTTCAC 
CAGAACAAAA TCACCCCGCC GTAATTGCGC GTCTAACGTT TCTTGCAGCG CAGACAGTTC 
CTCTAGTGCA AAGGCGTACA CGCGCTTCCT CCCCCAGGAA ACTTTTCTTA CCGCACGACA 
AAATTCGGGA CCAAACACAT ACACTGCACG CGCGTCAGAA GCGGCTGCCA AAACACACAC 
CCGATAGTGA GCTTCGGCAG CGGTAACTCC TAGCTCTCCC ATGTCACCGA G C ACAT AT AC 
TTTTGATACT GCAGAAATAT GCGCACACAA ATGAAGCGCC GCAGCCATTG AATCAGGATT 
CGCATTGTAA CAATCGAGCA AAAAGGTCAG CGACGCACAC ACCACATGAG AGCGGCCAAA 
GGGCGGTTTT ACCCGCTCCA TCCCCCGCTG AATTTCCTCA GCAGGAAGTC CTACCTGTGC 
GGCAAGCGCA ATCACTGCAA GCGCATTCTT TGCGTTATGC ACCCCAGGTA GTGGCACGCG 
AATCCATCGT CCTTGATATA AC ACGCGAG A ACCACGTAAA CCCTCATCTA TCACCTCAGT 
TGCTAGACCA CGCCCCCCCT GATCGTAAAC TACAACCCTA CCGTACGGGA TATTAGACAG 
GAATACAgAT ATGCATCGTC GGGGACAAAA CCCACGCTGT GTTCAGTAAA TTGAGAAAAA 
ATCTCTTTTT TCTCTTCCGC AATTGCCTGC TGCGTGCCCA GAATGC c T AC GTGCGCACAA 
CcTACGTTGG TAATGATCGC GTAATGAGGA ACAAGTATCT GAGCGAGCGT ACGCATCTCC 
CCCCGACGAT TCATCCCCAG CTCAAAGATT CCTACCTCAT GTTCTGCACG CACAAAAAAC 
AGCGACTGCG G t AAACCTA t CTCTGAATTT AAATTTCCTG GCGTCGCAAC CACCCGATAC 
CGTTCACTGA ACACCGCGCG AGC C ATTTCT TTTACGGTTG TCTTTCCGCT TGATCCGGTA 
ATGCCAATCC TAATAAGCGC AGGAAACTTT TTGCAGTAAA AGGAGGCAAG ATCTTGCAGC 
GCCCTGAGCG TGTCGTGCAC CGCAATACAG GCAGCTCCAA AGCGAGTGCA C C AAGCAAC A 
TATTCCCCAG CATGGGGGtA CCTTTGATCT ATAAGCGTTG CAACTGCGCC CTTCTGCAGC 
GCTTCTTCAA CAAACGTATG TCCATCTACG TGCGCACCAC GGAGCGGAAT AAACAAATCA 
CGCGGCACAA CCGCACGACT GTCAAAGGAA ACCCCGTCAA AACCGCGCGC CCCTCGTGCA 
TCGCACACGC GAGCCCCTTG CACTGCCGCA CATACCTCAT CAAAACTCAG AAGCATGAGG 
GGAGGAGTCC TTGTGCGCAG GACGCAGCTT AATGCGCACA ATTTCGGCGT GCGACGCCCG 
GTGTAAGTGC AGTTTCTCTA CCGCGAGCTT TTCAATACGA TCCGGCTTAG ACAGGATCGC 
GATCCCAGAA ATCTTACGCT TATTTTCTGC GATAATACGG TGTTGCTCTG CGTCGTACTC 
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ACGCACCACA CGCTCAACCG CCTGATAGCG CGAAGCCTGC CACACCCCCG CACACAGCAA 
CACGGGGATA CTCACCGTAA AGAACAACGC GCTCACCTTT TGCACCGTCA TCCCTCTTTG 
CCGGCGAGGT ATTCCCcTTC GCTGCTTTTC TCTCATCGCC CCTTTCTCAC GCATCAGCCC 
AGTCTCCTAC CGTTTCTCAA TGACGCGTAA CGTAGCACTC CgcGAAGCGG CATTCGCAGC 
ACGCTCCACA CACGAAGGTA CAAGCGGCTT CTTGGTAATC AGCGATGCAC GCGCCACACC 
GCCGCAGCTG CATATCGGCA CACGAGCCGG ACAGCTGCAG CGTTTCGCCC AGTGCCGAAA 
ATGTACCTTA ACAATCCTAT CCTCACGCGA GTGAAAACTA ATAACCGCAA GTCGACCCCC 
AGgCGCAAGC GCCGTAAACG CTGCCGTAAG AAGGCGTGGC AAACGCTCAA GCTCCCTGTT 
CAyCGCAATG CGCAATGCTT GAAACGCCTT GGTTGCCGGA TGGAGCTTCG GCAAAACCCC 
AAGAAC ACgC GCCGCTTCCC AAAmCGCGCC GTyCGCATCG GCGGCaCCAC GCGCGCAACG 
ACTTCTGCAA ATGCGCGCGC AGAGcAAAAG GGCGCCTGCC cGaaCTGCGc GCACACCGCC 
TGCGCAATCC GACGCGCGTA ACGTTC CTCT CCCCCTTCAA AAAACAATTG TGCCAAAnCG 
TCTGCGGCAG CCGATTCAGG AGGTCTGCAG CGGTCTGGGA GG 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1921 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
ATCCGGGATG CGCGGCCCCC TGGGGGAAAC GGAAATTGAA ATTCGCGCCG GGGCAGCCCG 
GGTGTGCCGC TCGCCgTGCG CGAACGGCAC kTGCATCGCG CACCCGCCGG TGCAGCGGGT 
GGGGGAGTGG AACGCCTGTC TACCGAACGG CGTCTTTCTG TACtGCACGG CACgACGCGG 
CTGAACC cGA AGCAGACGCC GTGCAGTAAC GCGCCGGcgC GCCCGCGCGC GTGCCGGCAG 
ACGGCGCTGG TGGGACAGTC CGCAGGCGGC GCCCCGCTAC CACGCGATTT TAAGGCCGCA 
CACAAAGGTG CCAAAGTGCT TATCTGAGGA AATATCTTCG GTAATTAGCA TGTACGGCGC 
GGTGGCCAGC CGCCCCTGCT CCCACCGGGC GTCAAACTCC ACTTTTTCTA TTGGGCTAAC 
GGTGAACCCC ACGTGGTACT GCATTGCCTT TTCGCGTAgc AGATTGTTAG CTTTGTCGGT 
GTTGAAACGG TTAGTGGTTC CGTACACCAC CGCGTACGGC TTGAGCCAGG CGTGGGAGCC 
GAGCGCAATC TGATAGCTCA GCCACGTTTT GCCCACGACC GGCAGgTTGA TAGGCCCCCT 
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CATTGTAGTC TTCTTGTAAT CGATGCCCCC GTTGTTTACG TAGGAGGTGT AGGTGAAGGG 
AATGTACAGG CGCGCCTCCA CCCCCGCGTT CAAGCCGGTG AGCAGGTGCG TGTAGGGGTC 
GCCGGACTTT GTTTCTAATT TTAAGAATGC GGCGCAGTCC AAGTAGTCTG TAAGCTGCCT 
GGAAAAGACG CGCTTGCCAA ACATATTGGC GCCGGCGGTG GCAAAATACG CGCCGGAAGA 
AAGCCACTTC CACTGCATGC TCAGCAAGGC GTCAgcGcCC AACTCGTGCA CGTGCAGGCG 
GTGCAGCCAC GCAAAGAACG TGACGAGCGC AATGGCCGGG TCCGGCTTTT TAAGGAGCTC 
GGCGAATCCC TGGGTGAGCT GGTTTACCAC CGCTCCCAGG CCCAGTGTGC TCTGAAGTAG 
ATTGGAGAGT TTTCCTTCCA AATGGGAAAC CACcAGTGCC AcTTGCGCTA TGAaGGCAGC 
GCCCGcAcTT TTTTGACAAA AAAcTCTCCG ACGCCACCGT TGGCCATaTC TGGGCCATTG 
TGCCCAAAAT cGGTTAACGy TGCAtTmAGG CTTcTGCCGm CTTGGsCTGC AgTTTACCGT 
CTTCCCTATA TCCGCGTCGC TCCGGTGAAT GTTTCCCACA TCAAGCGCCA GCACGAGCCG 
AAACCCGTAC CCCGGGGTCA GCGTCAGCCG TCCGCCGGCG CTCCACAGCA GGCGGTTTTT 
CTGCTCATTG TTTTTTCCCT GCTCAGTCCC CGTGGTGTAT TGGGGCTCCA GCGTGGCGTT 
CCCCGCCAGC TCCATCTTGA TGCGCTCAGC GCGGTGGTGC GTGTACGTAA GCGTGGCGTC 
TGCTCCAAAG CCGTACTTGC TGTGCGCAGT ACCACTATCC CACATACCGT TTGACGCAAA 
CGAGAGCAAG CCCACGTCCA ACCCAATGCC AtGCCGCCGA TATCCTGCGC ACGGTAGCCG 
AGCTTGCCCC CATAGCCGCC AAAGCCGGGs CATAGCGCAC GTCCTCCTGC TTGTAGTCGC 
TGGTCACGAA CGGGTCCCAC aG t GCGC AAA GTTAATAAAG CAGTTCGGGT CCTTGCCAAT 
GGTCAGGTAC GCGTTGTAGC AGTGGAGCGT TGCTTCAAAA GACGCTTTGG GTTTTTTGAG 
CGTAAAGGCC TGCCCCGGCC TGGGGGATTC AAAATCAACG GTCAGGTCCT TGAGCCGCAG 
TCCGCCCACA CGCCTGAGCG CGCCCCGCCG CGACGCAGGT GCGTGGCCTT GGGCACGAGG 
GGGAGCGAGA TTTTCAAATC ATTGGTGGTG CGAAACCCGT GCGTGTACTC GTTnnTTGCA 
C 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
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GTCAATTGCA TTAAGGTTAT CACGCGCGCC CACACCCTCG GCTATTGCGG CGGGGTGCGT 

ATGGCGGTGC GCATGGCAGA ACACGCCCGC GCCCACCACC GCGGGAGtTC TACACGCTCG 

GTCCGCTCGT CCACAATCCC GTGACGCTCG CCCGTTTGCG CGCGCGTGGC ATTGAGTGTC 

TGGATCCTGC TCATCTATCT TTTGCGCTGC ACGCTCCTGC GGcACCGGGC GCACGCCGCA 

TGCAGTGGAA GAAAAGACGG CGCGTACCGT GGTGATTAGA GCGCATGGGG TGGCACCTGA 

GGTGTATGAG GCCCTCGAGC GTTCCGGAGC GCAGGTGGTG GACGCCACCT GCCCGCGAGT 

TAAGGAAAGT CAGCGGCGTG CTCAGGGTTT TGCCGCGCAG GGACTGCACG TTATTCTCGC 

CGGGGACCGC AATCATGGGG AAATCG TTGG CATCGAGGGG TATGTGCGCG CGGGAGCTGC 

GCAGGCGTGC ArCCACTTGC CAGGCGGCGC ACCAGACGGC ATGCTGCCAC AGGTGCAGTG 

CTTTGTGGTG CAAAACGCGC GTGAgGCTGC CGCGTTGCCG TGTTTAgCGC GTGCAGctTC 

CTTgCCCAAA cTACCATTAC ACAgGGTGAA TAmGACGCGA TyGCCGCTGG GGgCGTAA 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
TTCTGCAGTG TACGTTAAAT CGGACGCATA CTATGGGAGG TGAGCATTGA T AAGGCG TAG 
ATATCGTGGT TGTACGCAGG AGCGTGGATA GTAAGTGTTG GTATGCTATT TGCATCGTGC 
ACTTCAGGGG CGTGGAAGGC ATCAGTAGAT CCGTTGGGGG TTGTGGGATC TGGTGCAGAT 
GTGTACCTGT ATTTCCCTGT AGCGGGGAAC GAGAATTTGA TTTCTCGTAT TATCGAGAAC 
CATGAGTCAA AGGCAGATAT TAAAAAAATA GTGGACAGGA CTACCGCGGT ATACGGTGCT 
TTTTTTGCCC GATCAAAAGA GTTTCGTTTG TTCGGAAGCG GTTCGTATCC ATACGC CTTT 
ACTAATTTGA TTTTTTCTCG ATCCGATGGC TGGGCATCTA CGAAAACGGA ACACGGAATC 
AC GTACT ATG AAAGTGAACA TACGGACGTT TCGATTCCTG CGCCGCATTT TTCCTGTGTG 
ATTTTTGGTT CCTCCAAAAG GGAGCGGATG AGCAAAATGC TGTCTCGGCT CGTTAACCCC 
GATCGACCGC AGTTACCGCC TCGCTTTGAA AAAGAATGTA CGTCGGAAGG TACGAGCCAG 
ACTGTTGCAC TCTATATAAA AAACGGGGGA CACTTTATTA CCAAACTGTT GAATTTTCCG 
CAGCTTAATT TACCACTTGG GGCAATGGAA CTGTACTTGA CCGCGCGGAG GAATGAGTAT 
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CTTTACACGT TGAGCTTGCA GCTGGGGAAT GCAAAGATAA ATT 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4968 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
GGCCCCAATC CCATAAAATA CTCTGGTTGT GTTTGCTGCG TATGACGTGA TAGTCAGCTC 
TTCCACCGTA AATGAGATCG CAAATACGGG GGCGGGTGCG ATATACAATG AGCATACAAG 
GATCATGCAT TCGTATCGTA AACGGTACAG TTTGATCTGC GCTCGATTCG AGTACGGACT 
TTCTAC TACT CACAACCTTC AAACTACGGG AATCGCTTTT GTCGACAATT AAAAGTTGAT 
ACATAACGGA GTGAAATACT TCACCAACCG ACAGTATAAC CCCATTGTGT TTTCTAGTTG 
TGTGAGATTC AAATCCCAAG GATGGCACGG AACTGAGTAC GCGCCAATAC ACCTGcCTTA 
ATCGTATAAA GCTTGCAGGT ACCACCACGT CGCTAAAAAC AAATACAAAA AGCGCACTGC 
ACAAACCAAT CGACAAAAGA GGGAATATCA AAAACCGCGG CGAAACGCCC AGCACTTGCA 
GCGCAAgcAG TTCAAGGTTG TTACGCAGCC GCCCCAACCC CAAGAGGGAA CCGAcCAACA 
CGGCAAAAGG AGTAACGTAA ATCAACGCAG CGGGAATGGA ATACAGCAAC AGCCGGACCA 
CGTCCGCAGC GCTCACGTGC TTGGTAAGTA ATGTCTGCGC AAAAAGCAGT ACGTTGTTCA 
CAAAAAAAAC CAAGAAAAAG CACAGCACGC CCACCAGGAT GTGTTTGAAC ACTACCTCGC 
ACACGTACAC AAAAAGCACT CTTCTCCACA TCTAGGTAAC TACAGAACCC CTGTGGATCC 
GTACCCTCCT TTTCCCCGTT CAGTCAGCGA GATACTACCG GAGCGCACAT ATTCAACACG 
GAGCACAGAA GAAATCACCG CCTGCGCGAT ACGATCTCCA TGGGAGACTA CGAAAGCAGC 
TAGCCCAAGG TTGACAAGCA ACACGCGTAT TTCCCCCCGG TAGTcAGCGT CTATCGTTCC 
CGGGGAATTT AAAACCGTCA CTCCGTATTC GAGTGCTAAC CCAGAGCGGG GACGAATTTG 
CATCTCCAAC CCCACAGGAA GCTCCACACA AACACCCGTG GGGACGAGAA CCCGGCCCAG 
GGGATGAACC TCAAGCGGTC CTCCGGGAAG AAAGGCCCGC AAATCGGCTC CACTTGAGCC 
TAACGTCTGG TACTCGGGAA AAGAAGCTCC CGGATACACG ACAGCTCGCA CACGGATCAT 
TTCGTCGTGT CCAGACCTCC AGACTTCCCC TCAAGAGCAT CGATATAGGA AAGATTCAAA 
CGACCCATCC TGTCGATATC AATCAACTTC ACACATATCC GCTGACCCTC TTGCAGCACA 
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TCGCTGACTT TGGACACGCG GCTGCGCGAC AGctTTGAAA CGTGGCAGAG TCCTTCCTTC 
CCTGGAAAGA TCTCCACAAA AGCACCGAAC TCTACGATTC GTTTCACTAC ACCCTGATAC 
ACCCTCCCTA CCCGAGGATC TTCAGTAAGG CCCACCACGG CGACCTTTGC GTCGAAAACG 
GACTGCGCAT CCCTTCCGGA GACGGTTACG GTACCGTCAC TATCAGTGTT GATAGTCACC 
CGATACTGGT CAGAAAGCGA CTTAACGGTT TTCCCCCCAG GACCGATGAG CGCGCCGATT 
TTTTCAACCG CTATTTTAAA ACTCTCAATA TGCGGCGCAT AGCGAGAAAT GTGcACGCTC 
GGTGCGCTGa TTGTCTGATT CATGAC AGAA AGAATATGGA GCCTACCTAC ACGAGCTTGC 
TGCAAAGCCT CCTTCATCAG AGACGCAGAC ACCGCCTCTA CCTTCACATC CATCTGAAAA 
CCGGTAATGC CGTCACACGT ACCTGCTACC TTGAAATCCA TATCACCGAG ATGGTCCTCC 
TCACCCAAAA kaTCCGAAAG AATCGCATAT CGCACGCCAT CGGTGATGAG CCCCATCGCG 
ATTCCCGCAA CAGGCTTTTT GATTGGGACC CCTGCATGGA GAAGAGAAAG CGTCCCTGAG 
CACACAGTCG CCATGGAGGA AGATCCATTC GACTCCAAAA TTTCTGAAAC CACACGCACG 
GTGTAAGGAA ACTGTTCTGG ATCCGGAATG AC TGCCG AGA GGGAACGATG CGCTAGACAC 
CCGTGCCCAA TCTCCCTCCG ACCAACCCCC ATTCTCCCTA TTTCCCCCAC TGAAAAAGGA 
GGAAAATTAT AGTGAAGGAT AAAATTCTCC CGTCTATCCC CTTCGATGTC GTCGTACACT 
TGCCCGTCCG ACATAGCACC GAGCGTGACC ACCGCGAGCG ATTGAGTCTC CCCCCGGGTA 
AACACCGCAG ACCCATGCGG ACGCGGCAAC ACCCCGACCT CACAGGCGAT GGGCCGAATG 
GCATCAATGG CACGGCCATC GATGCGCAAA CCCCTGTCAA GAATGTTCAG CCGTAGTATC 
TCATACTCCA TCTCGTGGAA CAACGCGTCG AACAACCTGC GCTGCACATC GTTCTCAAGC 
TGAGCAGCAT ACTGCTGTGC AACATCACGC TTCAcCGCGT CGCAGGCACT GcgCCGCTCA 
CCCTTCCCCT GTGCATACAA AGCCTGCGCA AGACGCGGAT AGGCGAGCTC ATAAATACGA 
TCGCGACCTA CAAGcTGCGC AGAAGAAGGG ATAACCGTCT GTTTCTCCTT GCCACACAGT 
CCACGCAGAC GCTCCTGCAT ATCGCAAAGG GCTTTAATAT GCTCTTGTGC CTGTTCGAGC 
GCGCCGAGCA TGAGGTCCTC GGACACCTCT CGCGCACCAC CTTCCACCAT GGTAATGCCC 
TGCCTAGTGC CTGCAACGAC AACCTCCATA CTGGCGGCAT CAATCTGAGA AAAGGTAGGA 
TTAATAACAT AGGAACCGTT CAGATATGCA ACGCGGACTG CAGCAACCGG TCCATGGAAG 
GGGATATCCG AAAGAGTAAC GGCAGCTGAA CTGGCAACAA TAGCCAAGAC GTCATGAGGA 
TGGACCATAT CCGACGATAT GCACGTAGGG ACAACGTGTA TATCACGTCC AAACTCCTTT 
TCAAAGAGCG GCCGCATCGG ACGATCAATG AGGCGCGAAA TGAGAATCTC TCTGTCTTTC 
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GGACGGCCTT CACGCTTGAT 
TTGAAGTCAA CAGTGAGCGG 
ACGGTCGCGA GAATCGCCGT 
GCCAGATACC CACTTTCAAG 
TTCATACGTT TCCTAAGAAC 
CTGCGAAGCT AGACAAAGAA 
CACGCACTGC GAAACTTCCC 
AGCGAACGAT ACGCCCCCAT 
CCCACCAGAA CGAGCAACCC 
CAGTCAAcTG CCTAATCCTC 
CCTTTTCTCC TGAGCCGTAC 
TTCTCCTTCT CCTTTCCTAC 
GCGCC TCGGT ACCGAGCCTC 
ACCCACCCCA CTCCCCTGTA 
CTCACACGCC CACGCTCGGn 
ATTACCTCCC CGTCAAGCGA 
GCGCAGCGcA cckCACGGGT 
GACCCACCGC GTCaGACAAA 
GTGTCCAGCC CATGTCCACA 
GCACGCAACA GGGTGTTGAA 
TCAATGAGCA CGACGAAATC 
GTTGACAGAT CCCCCTCGTA 
ATCAGCCCGG TGCACCGCGC 
TCGAAGAGAA AAGCATGTCC 
CCCCGATCAC ACGCTATACA 
GCCTCACTCC GTACAAAATA 
TCCAAGACAG CGACCCTCGG 
GACTGGAACC AACATGCGCG 
ATGCACTGAA CACGCCGTAA 




GAAGCCGCCA GGCATCTTCC CCACCGCATA ATACTTTTCG 3120 

GACATAGTCG AGCCCTTCCT GTCGCTGAGC AGAGGAGCAT 3180 

ACCTTCACAC TGTAAATACA CGGACCCGTT CGCTTGCCGC 3240 

GAGAAGGGGG TGGTCCCCAA TAGTGCCGGT TATGCTGTGT 3300 

AGAATGTATC GCAGGCCGCA CCAAGCCCTG GCACAGCCAC 3360 

AGCAGAACGG ACAACCGTCC TATGCACAAC CCCTCGGCGG 3420 

CATGCAAAGC CCCTACTTCC TCAAACCGAG GCTTTTGACA 3480 

ACTCACGCGC CGGGAATATC TCAACAGGCG ACGACGACGC 3540 

CCGGTTGCTA CTTTTGTCtT CGGATGAACC TTACAGTGGT 3 600 

TCAGTGAGAA GAGCGATCTG CACACTCGAA GATCCTGTAT 3660 

TGCTGCACTA CCGAAGCAGT ACGTTCCTTT GTCAGTGCCA 3720 

TCGAACTCGA TAGAGTCAAG CGCCTCCCGC CCATGCGACA 37 80 

AGCTGCACGA GAGAGGAACC CACACTAAGT GCGCACGCAG 3840 

CAAGGGTACC GCATACTCGC CCTCAGGTGG AAGCGTCTGA 3900 

GCACAACGTG CGGGCACACC CCyCGCTCTG CTGCCAGGGA 3960 

AAACGCACGC CCCAGCAGAC GCCGCGCGCT CTCAAAGTCT 4020 

ACCGCGCTCG AACTCACCCG AACTCCCTCG AGAGTGTAGT 4080 

GCTATGCGCA TCACCAAGCC GACGCAGCTC ACGGACCCCG 4140 

CCGAAAGTCG ACACCAACTG CGAGGTAGCA CACCCTCACT 4200 

GAAGACACCA CCCGGGATTC TAGCAAAATC CTTGGAAAAG 4260 

AAAGCCGCGC GCACGAAAAT AACGGAGTCG CAAACGCAGC 4320 

AGAAGAGGTT TTGTGCTTCC TGGGAGGATG AGTAAAGGTG 4380 

CCGATCGGCC ACAGGAGCGC ACGCAGCGGC AAACACCTTA 4440 

CCGGTGAGGA CCGTCAAACC CCCCCACCGA TATTGCTGCT 4500 

TGCACCCTCC TGCAACTGAG ACCAACGAAA AATGCGnCAC 4560 

CCGCCATCAT ACGAAAACCC ATTCGCCGCT TTCCTTATTA 4620 

AAAACACCAA CGCACGTTCC CCGGGCTTCG CGTACCAAAC 4 680 

TATTCTTTCC CCATGCAAAA ACCGCGAACG CATACTGCGC 4740 

GTCCAATACG ATTCGCAAAG TCCACATCAC AGCTAACCAC 4800 
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TGCCTGCTTA ATCTCACGCA CCGTCAAGTC TTCACACCCG AAAGAGACTG AATCGGTCGC 
CACGCCGCCT GAGGACGGAG GCGGTGGAGC ACCCACATCA AAGCAAGCGG CACCGCAAGA 
CGCACGCTCT TTGCCCCATG TCCAGGACCC GAGACGAACA CACCCGCC 
(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
AGTTTCGCGC GGGCTAAAGT GGGTTGGTCG CGTGTGGGTG CCAGCGCGCT GTACTTGTCG 
CGCGTTATTT CGTGCATTGA GCGTTCTAAG GTGGTTTAGC GCTTATGAAG GAGAATTCTT 
GCACGGCGTG CAGCAGACGG CTCGCCTTGT TCGTGGGCGC TGCGGTGCTT GTGGTAGGCT 
GTTCATCCAA GACGGATGTC ACGCTCAACC GTGACAAGCC CCTAGTGTTT TTTAACCGAC 
AGCCCTCTGA ATCCCTCACG GGGAAGGTTG ACATGGCTGC CATGAACTGG AACGACAAAA 
CCTATTACGT GGGTTTTGAC GCTAAGTTTG GTGGTTCTAT ACAGGGAAAG ATGATTC TAG 
ACTTCCTCGC CTCTTCTGAG TCCTCGGTTG AC CGC AACGG TGACGGCATC ATCGGTTATG 
TGCTTTGCAT CGGTGACGTC GGGCACAATG ATTCGAAAGT CCGCACCGAG GGTATTCGCC 
GCGCGTTGGG CACGTGGACC GGCTCCTCGG ATCCGGGACA GGCGAAAGAA GGCCAGGCAG 
TGGTAGGAGG GAAATCCTAC AAGGTGGTGG AGCTCGAGGG AAAGGCGATG ACGGGAACTG 
ACGGTTCCAC TTGGAATACG AATTCTGCAA CCGAGTCAAT GGGAAGCTGG GTGGCAAAgT 
TCGCGGATAA GATAGACCTG GTCATCTCAA ACAACGACGG GATGGCAATG GGCTGTCTGC 
AGGCGTCCAA TTATCCGCGG GGGCTGCCTA TTTTCGGATA CGACGCAAAT GCGGACGCGG 
TCGAGTCGGT TGGTAAGGGT GAGCTCACGG GGACTGTCTC TCAGAACGTC GACGCGCAGG 
CTGTTGCAGT GTTGCAGATT ATCAGGAATT TGCTCGATGG CTCCAGCGGG GAAGATGTGG 
TCGCCAACGG TATTTCAAGA CCTGACGCCC ATGGCAACAA GATAAGCGCG CCCGTGCAGT 
ACTGGGaAGA TGTTAAAGCG ATTATGGCCG ATAACTCGGA GGTCACGAgC GCmAACTGGA 
AAGAGTACAC CAGGGGAGCA CGGGATGCAG GGGTGCGACA GGTAAGTGCG CCGACGAAAA 
AGGTGCTGCT CACTGTCCAC AACGCGAGCA ATGATTTCCT TGCTTCTGCC TATCTTCCCG 
CACTGAAGCA TTACGCTCCG CTCCTGAATG TCGATCTCAC TGTCGTGCAG GGCGATGGCC 
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AAAACGAGCT AAGTTGCCTT GATAAGTTCA CTAATCTCGA CATGTTCGAC GCGTTCGCGG 
TaAACATGGT AAAAACGAAC TCGGGCGCTG ACTATACAGA CAAGCTCAAA TACTGAGCAG 
CCGGGTTTGG ACGTGCGTTG GGTAGCTGCT GTTCCTGGTG CACGTCCTGT TCGTTGAATA 
GGTAGGGTCT ACCGACCTCG CACCGCTTTC GCGCGCGAGA GG AG TG AT AG TTGCGATGTG 
CGATGTACTC ACCATAAGGG ATCTTTCTAA GTCTTTTGCG AGGAACAGGG TTCTCAACGG 
GGTGAACTTC CGTATGGGAA AGGGTGCCGT GGTGGGGCTT ATGGGAGAAA ATGGTGCGGG 
AAAATCCACG CTtATGAAGT GCCTCTTTGG AATGTACGCT AAGGACACTG GTCAGATTCT 
CGTGGATGGA AGCCCGGTGG AC TTTC AG TC TCCCAAAGAA GCGC TAGAAA ACGGTGTCGC 
CATGGTCCAT CAGGAGCTCA ATCAATGCCT TGATCGCACT GTCATGGACA ATTTGTTTCT 
CGGCAGGTAC CCTGCCCGTT TCGGG AT AG T TGACGAGAAA CGCATGTTCG ACGACTCCCT 
CACTCTGTTC GCTTCCTTGA AAATGGACGT AAACCCGCGG GCCGTCATGC GCAgcATkTC 
TGTcTCTCAG CGGCAGATGG TAGAGATTGC CAAGGCGATG TCCTATAACG CGAAGATTAT 
AGTC CTCG AC GAGCCTACTT CCTCTCTCAC GGAGAGGGAG ATTGTCAGGC TCTTTGCCAT 
TATACGAGAC CTGAGCAAAA AAGGAGTGGC ATTCATCTAT ATCTCCCACA AAATGGATGA 
GATCTTTCAG ATCTGCAGCG AGGTGATTGT GCTGCGGGAT GGTGTCCTCA CGCTCTCACA 
ATCCATAGGG GAAGTGGAAA TGAGCGACCT CATCACCGCT ATGGTCGGGC GCACTTTGGA 
CAAGCGCTTT CCCGACGCTG ACAATACCGT CGGTGACGAT TATCTTGAAA TACGAGGTCT 
TTCTACAAGG TATGctCCGC AGCTGCGGGA TATTTCCCTT TCTGTGAAAA GGGGCGAGAT 
TTTTGGCTTG TACGGGCTGG TCGGTGCGGG GAGGAGTGAA CTGCTTGAAG CGATTTTCGG 
CCTGCGTACC ATCGCAGACG GTGAGATCTC TTTAGCAGGA AAAAAAATTC GCTTGAAGAG 
CAGCAGGGAC GCAATGAAAC TCAATTTCGC CTTTGTGCCC GAGGAACGTA AGCTCAACGG 
AATGTTCGCA AAGGGGAGCA TAGAGTATAA CACCACGATT GCAAATCTCC CTGCGTATAA 
GCGTTACGGT CTACTCTCAA AGAAAAAGCT GCAGGAGGCA GCGGaG sGGG AAATAAAGGC 
CATGCGCGTG AAGTGCGTTT CTCCAAGCGA GCTTATCAGT GCGCTCAGCG GGGGTAATCA 
GCAGAAAGTC ATTATTGGAA AGTGGCTCGA ACGCGATCCC GACGTCCTCT TGCTTGATGA 
GCCGACCAGG GGGATCGACG TGGGTGCGAA ATATGAAATT TATCAGCTCA TCATTCGTAT 
GGCGCGTGAG GGAAAGACAA TCATTGTGGT TTCTAGTGAA ATGCCTGAAA TTCTTGGAAT 
CACCAACAGG ATCGCAGTCA TGTCCAATTA TCGATTGGCT GGGATTGTGG ATACAAAGAG 
TACCGATCAG GAAGCCTTGC TCAGACTTTC TGCGCGATAC CTGTAGGGAG GAGCAGATAC 
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ATGCGCGATC GTACACAGTG TGTGGCGGTG CCAACTCAAG CGTTCAATGA GATTTTAGAT 
CAGGACGGTC AGCTCACCGC GTACGCCCAA AGGCTCGAGC AGTTACGAGA GCGCGGTTCC 
CATAGGGTTG CCTTGCTCCG CGGGGAGCTT GCGCGCATAC GGCAGGATCA GGTCTTGGGC 
ATGCCGGAGA AAAGGGTGCA GGTTGCGGCG CACAGGCTCA AGATTTCCGA AGCGCAGGCC 
GTTGCACGAC AGTrmAAAAC TGAGGAAACG CAgTTGGtTA GGAArGsTGT CGCGCGTGTA 
AgGGGGCTCt TTCGAGACTT TGACTGCTCT GTGCGCGACG CGATGCGCGA ACAGCGGCTC 
TTGCTAAAGC AAGTTGCGAC GGTGCaGCAC ACCTCTGCCT CATCTGACCA AAGAGAGCAC 
TGTCTGGCTC AGCTCCGGCA ATGCmAGGAG GCGCGACACC ACGCCTACCG TTCCTTGGTC 
GAAAAGaGCt GCGCTGCGGA ACGGGAAAAT GaCGTTTATC GAGCGCGTGG TGCGTGCTCT 
TAGAGAATAT TCGTTCAATT TTGACGCAAC CCAGTTCTTC CTCGCAAATG GTTTGTACAT 
TGCTATTGCG GTATTCTTTA TTGCGTGCAT CGTAgTTGCA CCTTTCTCTG GTAATGGCAA 
TCTTCTTACC ATTCCCAACA TTCTC AC CAT ACTGGAGCAG TCTTCAGTGC GCATGTTCTA 
TGCGGTGGGA GTAGCAGGTA TTATCCTGCT GGCAGGAACT GACCTCAGCA TTGGGCGTAT 
GGTGGCAATG GGGTCTGTAG TCACGGGTAT TATTCTTCAT CCGGGACAGA ATATCGTTAC 
ATTTTTTGGA CTGGGGCCGT GGGATTTTAC CCCTGTCCCC ATGGCTGTCC GTGTAGTCAT 
GTCACTTGcA GTTTCTGTCG CACTTTGCGT TTCGTTCAGC CTATTTGCAG GATTCTTTTC 
TGCTCGCCTC AAAATACACC CTTTCATTTC AACTCTTGCA ACGCAGCTTA TCATCTACGG 
GGTTTTGTTT TTTGGGACAA GTGGTACGCC AGTTGGCTCT ATTGACCCAT ACATCAAAGA 
CCTATTCGGT GGGCGGTGGA TTCTAGGCAC CATGCAGGGC ACACTCGTGA CCTTCCCAAA 
GCTGATAATT CCTGCCACCA TTGCGGTGGC CATCGCGTGG TTCATTTGGA ACAAGACGAT 
TCTAGGAAAA AATATGTACG CCGTTGGAGG GAATGCTGAG GCAGCGAATG TTAGCGGCAT 
CAGTGTTTTC GGGGTGACTA TGAGCGTTTT TGCAATGGCA GCTGTGTTTT ATGGCTTTGG 
CGCGTTTTTT GAGACGTTCA AGGCAAATGC AAGTGCGGGC ACTGGTCAGG GTTATGAGCT 
CGACGCAATT GCCTCCTGTG TGGTAGGGGG TATCTCCTTC AACGGGGGAA TCGGAAAACT 
CGAGGGTGCC GTGGTAGGCG TAATCATTTT CACCGGTCTT ACCTATTGTC TGACTTTTTT 
AGGCATCGAT ACAAATCTTC AGTTCGTGTT CAAGGGTTTG ATCATCATCG CTGcAGTTGC 
ACTCGACAGT GTGAAGTATC TGAAACGCCG CTAGTTCTTG CCCcGCTGGG CGGGAcGTCA 
ACGTTCACAA TACGAATAAG CCGGGCGCCT TTCTGGGcCA TTGTTCCCTC TTTGGCTAAC 
TCAGGGTGTG GGCTGACAaG AAGGCcTCCG CTGTCCGAGC TCTACCGTGC TTCAGATGAG 
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CCcTTTtCTT TTCTC AG TAG TTCGAACGnc yTCGCGCGCA ACTTGGAGGA TAGGGTAATC 

TCTTACTGGA TCCGCAACCC G AAATC C ACT GTACCCAGAT TGCTCAAAGT TCTTGGTATC 

TCCCACATCC CcTGGTCCTC GTAACTTCAG ATCCTCTTCG GCGATAACAA ATCCATCCGC 

aGTACTTCCC ATAa TTTTC A GCCTGCGTTT CGCACACTCA GTCATTTCGT CCCCATGCAT 

TAaGAAACAA TACGACTGCA CATCACCCCG ACCAACCCGA CCACGCAGTT GATGTAGTGC 

AGAGAGGCCA AAAACTCCGC gTGCTCTATA ACGATACAAT TCGCATTTGG TACATCCACT 

CCCACTTCAA CAACGCTTGT AGCAACCAAG ATATGGACGG TACCTTCGCT GAAATACTTC 

ATGATACGCT GCTGCTCTTC CTCAGTC ATT . TTTGAGTGAA TCATCGCAAC AGCATATCGT 

GCAAAATAAT TTTTTAGATA CATATACATA CATTGCACCG ATTTT AAATC GGTTAATCCT 

ATGTCATGAA TACGTGGATA AATAAAATAT GCCTGCCTAC CTTTTTCTAT TTCATTTCCC 

ACAAACTCAT ACACCTTTTC TGCTTTCGTC TTTCTTGCAA TATACGTAAT CaCTGGTTTT 

CTTCCACCAG GCaTAGATTT AATTATTGAA ATATCTAAAT CACCAAATAC AGAAAGTGCa 

AGCGTACGTG GAATTGGAGT TGCGCTCATC ATAATAATGT GTGGAGTCTT TCCCTGAGGG 

TTCCCTTCCC TTCC TTTCTG AATCAAGGCC GAACGCTGTA ACAC TC C AAA ACGATGCTGT 

TCGTCAATGA TAAC CAACCT CAGATCATGG TATCTTACGC TCTTTGAAAA CAGCGCATGT 

GTTCCTACAA CTAAATTGAT TTCTCCTGCA ACAAGAGCTT CGAGCAAGTA CGCCCTTCCT 

TCACTTTTCA CATTACCTGT CAGGAATGCA AGTCGAATCC CAATAGGAGC AAG TAATCG A 

GCTGCaGTGT CAGCATGCTG GCGTGCAAGT AATTCAGTTG GAGCAAGCAG TGCGACCTGT 

CCACCTTGTT CAATAATTTT TAAACAAGAA AAAAACGCCa CTAACGTTTT TCCTGATCCA 

ACGTCTCCCT GAATTAGCCG TGCCATCGGT TCTTCTCTTT CAAGATCCTG CGTAATTTCT 

GTAATTACTC TTTTCTGATC CACTGTCAAC TCAAATGGCA AACACCGGTG AAGTTTCTTC 

TGTAACAAAG ATAATTCAGA AACAACTGAC GGAATAGCCG ACTGCTGATC AGATTCTCCC 

TGTGTAAGAG GCAATCTCCC CCGCTTCTGT AAAGAGCGCA TACCGATAGT CATTTGAAGA 

GAAAAAAATT CTTCAAATAT CAAAGA 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20757 base pairs 
<B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
GCAGCGGCCG GTGTGGGCGC CCACCGGCGG GCGGTATGCG TCTTTGGACG GTGCGTTTAC 
CGCGCTGGCA ATGATGCAAG TTTCTTTGAG GCAAATCCGG CAGGAAGTGC GAACATGACG 
CACGGGrAGC TGGCTTTCTT CCATACCACT GGCTTTGGCT CGTTTCACGC CGAAACGCTC 
TCTTACGTTG GCCAGTCGGG CAACTGGGGA TACGGCGCGT CGATGCGTAT GTTTTTCCCT 
GAATCTGGGT TTGACTTTTC TACCACCACG GAGCCCGTGT GCACACCTGC TTCGAACCCC 
ATTAAGCAGC GCGrGGCAAT TGGAATCATC AACTTTGCCC GGCGTATCGG AGGTCTCTCC 
CTGGGAGCCA AC CTG AAGGC GGGGTTCCGC GACGCGCAGG GCCTGCAGCA CACCTCTGTC 
TCCAGTGACA TCGGCTTGCA GTGGGTGGGG AACGTTGCCA AGTCCTTTAC CTCTGAAGAG 
CCCAACCTGT ACATCGGGCT TGCGGCCACC AACTTGGGAT TGACCGTAAA GGTCTCGGAC 
AAGATAGAGA AC TGC AC GAG TACCTGTGAA AAGTGTGGTT GCTGCAAGGA GAGGTGCTGC 
TGCAACGGCA AGAAGGCGTG CTGCAAGGAC TGCGACTGTA ACTGCCCCTG TCAGGACTGC 
AACGACAAAG GTACGGTGCA CGCAACAGAC ACCATGCTGC GTGCAGGGTT TGCATACCGG 
CCCTTCAGCT GGTTCCTCTT TAGCCTTGGT GCCACCACCA GCATGAATGT GCAGACCTTG 
GCTAGTAGTG ACGCCAAGTC GCTGTACCAG AACCTGGCTT ACAGCATAGG CGCCATGTTT 
GATCCCTTCA GCTTCCTGAG CTTGAGTTCG AGCTTCCGCA TCAACCACAA GGCTAACATG 
CGAGTGGGAG TGGGTGCAGA GGCGCGCATT GCCCGCATTA AGCTGAACGC GGGATACCGC 
TGTGACGTCA GCGACATCAG CAGTGGGAGT GGGTGCACAG GCGCGAAGGC TTCGCACTAC 
CTTTCCTTGG GTGGCGCGAT AC TGCTCGGC CGAAATTAAT TCATAATATG CCGGGGCGCC 
CGCCGGTGCC CTGCTGAAGA ATGCGGaCGG cAAGACGTGG AGGGGGTTTT GCCGCTTTTT 
TGGTGCGGCG GCCGTGTGTG TCGGCTGCGC CAGTACGCGT AGGAGGACGA TTGGTGTTCG 
GTTTTGCACG CGTCGGTTCG CGCGGGCTCT GCTTGGGGGC CCTCCTGCTC TCCCyTCGCA 
TCgTGTTGGC ACAGCACGTT GCTGACGCTC CTTTGGGCGC ACGCGGGGTT GTTCCGCGCA 
GnTCCTTGCC TCGGCGCACG CGGGCGGCCC GGGCTACGAC GCTGCGATCT CGGGGCGGCG 
TGGTCAGTTC CgCGCGAGCG GGGGAACGCT CGTGGTCACC GCGCAGAAAC CGAAGGTCAT 
GGCACGAAAT GACGTGGACT ACCGTCCGCT CTCCCTGCAG GCCGGCGGCA GACAAGGCTC 
GTTGGACCTT GTTGCCACgC AACGGCGGAT GACGCCAGCT TCTTTGAAGC GAACGCCGcA 
GGAAGCGCCA CCATAcCGCG CATGACGCTC GCCTTTTTTC ACACCATGCG CATTTCCGAC 
TCCCACATAG ACGTACTTTC CTTTGTCGGG CGGGCGGGGC GCACCGGCTA CGGCGTTTCG 
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GCACGCGCCT TTTACCCAGA CATGTCCAGC AAAACCACCG GCTTCGTGGG AATTTTTAAC 1740 

GTATCGCACG CTTTCTCTTC CGCCTATCGC TTTAAGGGCG TGAGCGTGGG CGCAAACCTT 1800 

AAGGTGGGGT ATCGCCACAC CCGGGGGGGG GGGTAGCAGC CAGTCAAAGA GCTCCAACGG 1860 

GAAGGAGAAC CACCACATAG TCCTGACCGC GGACGTAGGG GTGCGCGGTG CGTGGACGGT 1920 

GTCTAAAAAC TTTGGTGCGC ATGAGCCAAA CCTGTGGGCA GGAGTAGCAT TCCGCAACAT 1980 

TGGCGCGTCA ATCAACGCCA CAAACCTTCA CGGAAATAAC GGCGCCGGAG GCAGCGGCGG 2040 

CGGTGGAGGG GGCAATGGCG ACGGGAAACC TGCCCACGTC ACGGACTCCC GCGTTATCCT 2100 

TGCGCTTGCG TACCAGCCGG TGCGGTATTT TCTTTTTGGC GCCGGGCTTG AGTGGCTCTA 2160 

CAATGTGGGG TCTATCAAAG CCGTCAATTC GCTCCGGTAT GGGGCGGCGT TCATGCTTTT 2220 

TCCGCTCAGG CAATTGGCAT TCAGCTCGAG CGTGGTTATG AAGGGGATGG GTCCACAGCA 2280 

GGTCCGCGCG AGCGCAGGGG CAGAAGTGCA GTTTTCTCAC GTGCGGTGCA CCGCCTCGTA 2340 

TTCGTATCTT TGGAGTGCGA CACCCACACG GCCsCACTAC GTTTCAATTG GGGTAGCCGG 2400 

TTTTCTCAAA CCGGTTCCCG AACAACCCCT GTGGCAAGAG GTGTACCGCT CCTATTTGCG 2460 

CGGTGCGCCA CTACCACGCG CAgCTaCGCA GAGGCCATCG CCGAGTGGAA GCGCACGCTG 2520 

CAGCAGGGCG TCAGTTTTGA GCCTGCGCGG GAAGGCATCG AGCGCGCCAC CAAGCTTTTG 2580 

CAGCTGAACC AAAAGGTTCA CGATTTTAAC ATTTTCTAGC CGCGCCGCCG cGCAtCATCT 2640 

GCTCCGTCCT GTGCCACGCT GCCGGCACCG GCGGAAAGTG GGGGCGGACC CCTTACTCGA 2700 

CCGTCACCGA TTTTGCCAGG TTCCGCGGCT GGTCAATGTC AGTCCCyTTG AGCAGGGCAA 27 60 

TGTGGTATGC CAACAGCTGC AGCGGTACGG CGTAAAAAAT CGGCGCGGTA AGGGGAGATA 2820 

CCGAAGtACG GTGACTATCT GACTGCACGC CCCTTCCGGC GCGTCCGCCT CGGGCGTGCA 2880 

TACCGGGCCA AAACGCTCCG GCACGTCCGT AAAGATGTAG AGCATCCCGC CGCGCGCGCG 2940 

AACTTCCTCG ATGTTTGAGG CCATTTTTTC AAACAGGACG CCAGGTGACG CCGGCGCGAT 3000 

TGCAACCACC GGCATCTGCG CGTCCACTAA TGCAAGGGGC CCATGCTTTA GCTCCCCCGC 3060 

TGCGTATGCT TCGGCATGGA TGTACGAAAT TTCTTTCAGC TTGAGCGCCG ATTCAATTGC 3120 

AATCGGATAC AATTCCCCAC GCCCCAAAAA GAGCgCATGC TGCGCATGCA CAAAATGCCG 3180 

CGCGCACCGc GCAACGTCTG CCTCACACTC AAGCACGTGC TCCACATCCT GAGGCAGTCG 3240 

CTGGAGCGCC GCAGAGAGCG CGTCCTCGGG CTCcTGCGTG AGTATCTTTT TTGCCTGCGC 3300 

AATCATGCGG GTGAGCACGA GCAAGCACAC CAGCTGGGTG G^AAAAGAtT CGTTGAAGCA 3360 

ACCCCTATTT CTGACCCCGC GTGGGTGAGC AGCAtGCGTC CGACTCACGC ACCAACGTGG 3420 
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AACGTGCCCC GTTGCAAATG GCAATCGCAC AGAGATACCC TTGCGTTTTT GCCAGGCGCA 3480 

GTGCGGCAAT GGTGTCAGCC GTTTCTCCCG ACTGAGAAAT CG TC AG T AC T ATTTCACGCG 3540 

CGTGCACGAC GCTCGTGCGA TAsGGTACTC TGAGGCAATC TCCACCTGAC ATCCCACCCC 3 600 

TGCAAATGCC TCAAACCAGT AACGCGCCAC TAACCCTGCA TGGTACGAGG TACCACACGC 3660 

GATAATGCGC ACCCGTGTTA TCCGTCTAAA CAGCCGCTCA AACGTCTTAC ACGAGGTACC 3720 

GTCCAAGACC CGGTCCTCCC CGAACGTCCG CACCTGTGCG CGAGAAGACG AAGAAAACGA 3780 

CATATAGGCA TTCAGCGTAT GCCGTATAGC GTGTGGCTGC TGCCATATTT CTTGATGCaT 3840 

ATGGTGACGG TGCGTACCCT TATCCTGCGT ACAAAGCTGC ATCTGATACG TAACAACAGG 3900 

ACGCGCCACA ACGTTTCCCT GCGCGTCGTG GACGCACACG CTATCTCGGT GGACGTCTGC 3960 

GATGTCTCCT TCCTCAAGAT ACAAAAAACG CTGCGTAACA TGCGCAAgcG CAAGs gGGTC 4020 

TGACGTAACA AAATTTTCCC CACAGCCGAG TCCTACCGCC AACGGACTGC CAGAACGCGC 4080 

AgCAATCAAC CGCCCAGGAG ATGCAGCGTC CATGCAAAGT AACCCGTAGt ACCCCGAACC 4140 

TGCGTCAACA CTTTTTTTAC CGCAAGCAGG AGGTGCGCCG TGTACCGCAA CTCCCAGTGC 4200 

AAAAGATGCG CGAGCACCTC GCTATCAGTT TGTGAATGAA AAAAATAGCC ACGGGTCACT 4260 

AGCATTTCAC GCAAAGACCG ATGGTTTTCA ACAATACCGT TGTGAACTAT CGCAACGGAT 4320 

TCAGAACAAT GCGGATGCGC ATTCGCTGCA CACGGCTTGC CGTGCGTTGC CCACCGGGTG 43 80 

TGCGC AATGC . CCATGGTCCC GCAAAGAGGA CTCTGACCTA ATAGCGCGCA GAGCGACTGA 4440 

ACACGACCCT CACAGCGTAA AAGGCGGAGc GCACAGTCCG AGCCAACGAC AGCGATCCCT 4500 

GCAGAATCAT ACCCGCGGTA TTCAAGACGA CGCAGCCCCT CAAGCAAGAG ACCTGAGACA 4560 

TCACGCCCCG CCACCATCCC AACGATTCCA CACATAGACG CCTTTTCAGT GCAAAAGCAC 4620 

ACGGGAGAAC GTTAAACTCA AACCCACAGA ACACACGCAC GCGTAGGAAC CATTCTGACT 4680 

CAACCAAGCC ACGAGCCGCC TCGCAGGCTA CGGTCCAGCG CACCAACAGG GAGACTCACC 4740 

ACCTCCCCCA AGGGGAGACC CGACGAACGA GCAAGGTAAC CCACAGGTCA CAGGCGATCC 4800 

CATGCCCATC CGTTGCCTTG CTGCACATCG CTCACTTTTT CGAAGCTGAA TCTGGGAGTA 4860 

TTTCCTGTAA CTCTATCTCA AAAACTAGGA GCGCACCTGG AGGGATAACC CCCTCGATAC 4920 

CACGCTCCCC ATACCCCAAG GAAGAAGGCA CATAAAACCG ATAGGTAGAA CCCACCGGCA 4980 

TCAGCTTTAA GCCCTCAGAT ACCCCAGGCA CCATACCATC CACCGGAAAC TCCGCAGGCT 5040 

TATCTCGAGA GGCATCAAAC ACCGTTCCAT CAAGCAGCGT CCCCTTGTAC TGAGTGCGCA 5100 

CCCTCTGAcC GCCtGCGGCT TTGGACCATC TGCAGCCTTT ACCACCTCGT ACTGCAACCC 5160 
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AGAGGAAGTT ACCTGCACAC CTGGCTTCTT CGCATTCTCT TCAAGAAAGG CCTTTGCCTC 
CTGCGAATTC TTCTCTACCT CTTTTTGCCG ATATGCCTCG AACGCACGCT GTAGCACGTC 
CTGCGCATCC GCAAGTGCCT GCTTATCCTT GTCTGCACTG ACTGTCTTCT TTAGACCCTT 
CCACACCTGA CCCAGGTCAA CGTCTAACTT CGAATCCTGC AGCGTCACCC CCATGAGAAC 
CCCAAAGGCA TACCCCACAC TCTTCTTTGA AAGCGGCGTC TTCTGcCGCT CCTGCTCCTG 
CACCTTTCGC GGATCGAGCT GATCAGCAGT CAGCGCCTTC TTATCCGCAG CCTCCCCGGC 
AGAGGACACC CCCTCAGCAC CCTTCCGACA GGAAAAAATA CTCATACCCG CAAGGAGCAG 
AAGCGCGAAA GAACACACCC CTGcAGCTTC TTTTTTCAAA ATCACCGCTC CTTTTGCGCG 
CTCATCCGCG CCGCCCCGAG GCGACCAGCC TACCCCATCC GTACTTCACA CGTCAAGGAG 
ATAGCTAATG CCCCGATCCT TCCTTGAGCC ACACCCACTT TTCAGGCATC GTACGCACCA 
TGAACCTCTA TTTGGATTGG GCTGCAGCTG CTATACCCGA AACCCGGCTT ATCCTACGCG 
AAACGCAACA GGCCTGTACT CTCTTTGCCA ACCCTTCTGC TGTGCATTGC TTGGGAAACG 
ATGCGcGCGA TGCGCTCGAA CGAGCACGAC ACACGTGTGC CCAGAGCTTA GGCATCAATC 
CAGAACGCCT TATATTCACT TCAGGCGGGA GCGAAGCAAA TCAACTTGCC CTCCTTTCTG 
TCCTTACTCG TCTTCCTCAC GCAGAAATCA GCGTCAGTAT GCTAGAGCAC GCGTCGGTCA 
CTGCACTTTT GCCCCGGCTT GAGCGGTTGC AGGTATCCGT ACGCCACATC CCCGTCAATG 
CCCGCGGTTT CATTACCCCT GAGGCTGTAC GTGCAACGCT CAGTCCCCX3T ACCACGCTAG 
TGTGCGTGAG CGCCGTACAT AGTGAAACCG GCGCCATCCA GCCGCTCCCT GCTATTGCGC 
ACGTGCTTGC ACATACAGGC ACACGCGGAC GCTCTATCCA GCTCCACGTA GACGCCGCAC 
AGGCCTTTGG GAAAATACCG CTCAATCTGT ATATGGACCT TCCGCGCATA GAGGAACATG 
CACAGGAAAA CAACGCGCCA CAGACACCAC CGGGCTACCC CGCACCCACT GCACAACGCG 
CGCTTACCTA CTCGGTAGCA ATCAGTGGCC ACAAAATAGG CGCACCACGG GGTATTGGGC 
TACTGTGCGC ACACCGTTCA TTTACCCCCT TTGTCCTGGG AGGCGGACAG GAAAAAGAGC 
GCCGCCCGGG AACTGAGAAC CTTGCAGGTG CGCTCGCGCT CGCCGCTTGC GTGCGCGAAG 
GCGCCTTCTT CCGTACTCTA CATACCACTC CGGAAGGCCC TACACCGCAT ACGAAGCCCA 
CAGCTCCTGC AGGGTTACGC AGTGTcCGAG CGCGTACGTG CGCCTTTGTG CGTGCACTCA 
GCGATTTACC GCGGGTGCAA CTAGTCCCTG CAACGCGCAA AGAAGACGAA GCGCACTTCT 
CTCCCTACAT CGTCTGCTGT GCGGTACAAG ACGCCAGCGG cGAGGCATTA GTACGTGCAT 
TCTCAGACGC AGGTGTGTGC ATCTCCACCG GTTCTGCCTG CTCGACAAAG AAAGGTGGCG 
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TTTCAACACG CCTTCTGCGT GCACTCGGGG TAGAATcCCg cGCAAgCGCG GCGtGCTGCG 
TTTCTCTTTT GGTCCACACA CCACCGCCGA AGATCTCGAT CGGGTCTTAA CGCTTTTCCG 
TACCCTGCTG CAAAAACTAT GACCGCTCCA CACAGAAAGG GTACACGCAC AGGAAGCACT 
CAAGCGCCGC ACACAACCCC TTATTTTGTC AAAATTTTTA ACGGATTTAC CACCGACCCG 
TTCTTAAATA CTGAAAAATG CAAATGCGGA CCAGTTGCCC GACCGCTAGC CCCCACGCGC 
CCAATGGTCG TCCCCTGGAC TACCCGCGCC CCACGCCCTA CCATCACCGA ACTTAAATGC 
CCATACATGG TTTGGTATCC ACCGCCGTGC ACGATGATCA GGTAATTCCC GTAAATTCTG 
CTATATCCAA TTTCTGCCAC TTTCCCATCG AGCGTTGCTT TCACCTGCGT CCCATAGGGT 
GCGGCTAGGT CAATCCCATT GTGAAAGCTC CTTTTGCCTG AAAAGGGATC TGAGCGGTAC 
CCAAACCCAG AGGTGCGCCG CCCGCGAATT GGATACATGA ATAACTCCCC CAACACCTTC 
CTCAAATCAA AAGCAGATAA TTTTGCACCC GGAATAAACA AACGCTGTCC AACTGTTAAT 
GCACGACTGA CTAAATCATT CGCATCCAGC AACGTATTCA GGGGCAAACG AAAGAGACTG 
GCAATTGCAC TAAGCGACTG CCCCTTTTGT ACCGTGTGCA TGAGTCCATC CATGGACGGA 
ATAGTAATTT GATCCCCTAC CGATAGTCTG CGCGCGTTTG AAATTCCGTT CACCGACAGC 
AGCGTCCCCA TATGCTTGAG TCCTGcGCgC ACAAtTGGcA CTAATAGTAT CTCCCTTACG 
TACGGTGTAC GTGCGGTAGc TCACCGTCGC AAGCTGCGTA TCCAACGCCT CTGCAAGCGC 
GGCAGGTTTC GCGCCGCCCT CGCGCGTTTC TACCTCCTCT GCACGAAAAA CCCCACGACG 
CAGCGCATGC GCCAGGTGCT GTCTGTTTCA AAACGCACCA GCGCAAgCCT TCTGCCACCG 
GTAACGTGCA mCGCCCACGG GATACACAGC ACCATGAGCA CAAAGAGCAG CGCGCAAGCC 
AACGCCGCAC ACAACGGCAC ACAAGAAGCA TCCTGCAAAG AACGTACAGG CACCGAAAAG 
GAAGGAAAAG AAAAACGAGG CACCcGGTGC GCGCAAGCGG CGTGCGTGCA CTCTTTTCTC 
TCCGGAAAAA ACGCATCCCC CCTATACaCT CCtGCGGCTC ACCCGAACAG AACGGTTTGA 
AATACTGCTG AGCAACCCCG AAGCGTTGAG CGCCTGCGCA CGTATCAAAC GCGCTTCTGC 
ACGTGTGCTC CATTTTGCGT AGCGCGCGCg TAAGCCCCGC CCTCCTCCGA GTCTCCCACT 
GACTCGGCTG CCTGACTCAG GAATATCGAG GGTATCAGGA CTTTCATGAA ACGAAATTAT 
CTCTTCCTCA AGGGGAAAAC GGGGAACAAG ACGGTTACGC GTACGAGCGC GCGCACCCCC 
GCTATCGGAA AACGCACTAC TTGAAAGAAC TGCGCGGCGC ACTTCTGCAG GGCTCGGACA 
CACGCTGAAC GGCTCACATC CCCGCGCATG ATCCGGAACT AAGTCCGCAC CCGTGTACGT 
AATCACGTCC ATAACGGGCG GTATCGGAAC ATTTATTCCA GGCGTTGAGC ATTTCGCCTG 
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CAGCgCCACA GGcGGCGGaG TCCCCAGCAG CGTGCCACTT GTTTTTGAAC 
ACGCCCCGTA TAAAGAACAC ACCTCGCCGG GCGCAACCGC CGAGGGTGTC 
AAGCAGAGTT CAGTGAAGAA AATGAAAATC GTAAACCTGC CGGTTTAATT 
CGTTGCACGC TCGATACCTT CTCGCGCCGG CTCAAAGCTA CCGGCACGCT 
GCGCCATTCG GCAATCGCTT CCTCATAGTG cTGCGCATCA TAGTGACGCA 
gTATGAAGTG TATACTTCCT TTTCCAAGTG CTTGCGTCGG TCTCGATTAA 
AATTCCGCAG G AAA TG AC AT GCTCATCCTT GTCTGACTCA TAGGTGTATG 
ACGCACCCAC GTACTTTTGA ATTCTACTCC TGCACTTGCA CGGATATCTG 
GGTGAGAAAC ACATTTGAGC CAAACGCAAC GTACTGCACC GGC AGGAGC A 
GCCGTATCTG AATCTATTGT TGTCTGCAAA TTCTTGCACG TTGTACTTCC 
CGTGCCAAAG AGAAACCACC GTATCGGTTG ATACGCACAT GCGAGAATAA 
TGTAGCGTGC ACCGTCCTGC CGCCACTCAT GCTAGAACCG CTGTTGGATG 
AAcTGAAAGT CCAACGTTTT TGACAGTACC GCCCACCCAC AGATTCGGCT 
AAAATTCTTG GCCACCGACC ACGTCCCCTG CAGACCGATG TCTGCGGTCA 
CTTCTTACCC CCCTGGTTTT TTTTATTCCT CTCCCCGCCG GCTGAAGAAT 
AACTTTCACG TTGGTTCCCA CACTGATGCC TTTAAAACGA TAGGCAGACA 
CGCCACATTG AAAATGGCAA CGCCGCCTAC TGCCTTTCCC TCCATTGTCA 
CTGAACACTC GCAGAAAAAC CGTAGCCGGT CCGTCCTATG CTGTGCACAA 
ATCTGTGTGC GATTGATTAA CCCGGGCAAA GTGAAACCCT CCCACCAGGA 
CGCGGCACTC CCTGCTGCAT TCGCCTCAAA AAAACTTGCA TCGTCTGCTA 
TGCCAAGCCG AGCATTTCAT ATCTGCCGCC TGTCCTCAGC GAGAGGGCAC 
TCCCTTCTGC TTCTCGGATG TCTTTGCTGC CATTGCAACA ACCGGCGTCA 
AGACACGGCA CGCCGCACCt GCGGTGCGCC GGCACTCGTG CGCGAACTTC 
TCACCCCCTT GCGAAGACCA CGCGCCCAAA GAAGCTGTCC GCGCAAAAAG 
CACAAGGCAG ACATCCCAAT ACGCATCCCC AGAACCCGAA CTGATAGAAA 
CGCGCACCGC AGCACATCCG CACGCGCCAC CGCTTGAACA AGCTCTTCTC 
GTACCACTAG ACTACCTCAC CGCATCCGCT CCGTGCAACC CTTGCG CAC A 
TTTCCCAytA CtGCGCCCCT ACCAGTGCCC CGATTAcGTC AGAGAATGGG 
AcTCCAcGTA cGAAGGGCTA CGGCyTCTTG CAgTGCGCCA CcTGTAAGCG 



CGCACGGCGC 
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TCTATATCTG 
TGCTCCTTTC 
CGCTGCACAT 
GATGACCGCC 
AGTGCGCACG 
GTATGCGAAG 
GTcAGctTGC 
ACCCCAAAAT 
TACGTGCTGC 
As c grTCTCTA 
TGCGCAAGAG 
CTCGCACTCG 
CCTTTGGGAG 
CCAAGTCTGG 
CCCCGTACGC 

GCTGCACCGG 
TGCCAACCGA 
TCTCTCCCAC 
TACCCGCGTC 
TGTCTTCCAC 
CAGGAGCGGA 
CAAGAGCTGC 
CTGCCTCATT 
TGCTGCAAgC 
CATCACACGG 
CTCTGCAGAG 
CGCACTCCCT 
AAAGGGAAGC 



CAaTTGTACG CGCTATTTTT 
CTACTGCACG ATGGAACTCC 
TCTCAGGAGA AAGACGCGCA 
CGTTAATCGG CGCACATAGA 
CAACCGTCTT GcgCAAGCGC 
CAGGcGGCAG CACCTCCACC 
GCCAATACCG CTCCAcCGCC 
TCCCACATGC ACACGGATTG 
TTTTTCCTGC GCGACTGACT 
ATACCGGACG CTTAAATTGC 
AAATTTCTCC CGGAAGACAG 
AGTGCGGCGT GCGACACGGC 
GAGGCCTGCA ATACTGTGTA 
CAGAAGAAGC GCAAACCTAC 
AATAAGATGA TGTCCTCCAG 
CCGCACGTCT TCAAACCCAC 
GCGTTCTTGC CCTGTTCCCC 
AAATAGAAAT GGCGATACAC 
CGCACCCGGA CACCGCTTTG 
CTGGTGCACA GACGCACCTG 
AGAAACCTGC TTCCCCTCCC 
CTGCTCATGC GGCGCATCCT 
ACGAAgTTCT TGCACGGCAA 
TTCCTTTGGC ACGATGTAAT 
GTGCCACGCA CTGGCCGGAT 
CGAACCTCGC GTGcgCATCC 
CTCGCGTGTA CCTGGGCGCG 
TCCTTTTTCA GATCTGCAGG 
GCTGCATTGC GGATAGCAGC 
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GTCCTCCGCG 
CaTCTGTCAA 
GAATCCAATC 
GCCGTTCcCA 
ATGCCGGTTC 
CGCGATCTAA 
AGGTGCATAC 
GCAACTGAAA 
CTGTCTCAAG 
CTAAGAAGAG 
CCCCAATTAT 
GAGCGGGTCC 
CTCAAGTGCC 
CGTCTTGCCA 
CTGTAGTGCC 
CAACGCAGCG 
AGATGCGTCT 
CCCCTCAGGG 
CATAATTGCC 
AAgGTGCACG 
TGAGTTTGAC 
AGCTAGCTGC 
TCCAGGCGTG 
CGCGCTTAAT 
AGCTCAAGCT 
CCTGTCAGTT 
AATGCGATGG 
ATGAGAATGC 
TCCTTCGCTT 



CCCCGATAGC 
CATACACCAA 
TTCCACCGTT 
cTGCGCTTCA 
GGACAGCAGC 
AAGAGGCGCA 
CTTGTGCTGC 
CCGTGCAGGA 
CGGTGTGCGC 
CACTCCCCCA 
TCCTTCTGCG 
TGCTCAGCAC 
GTACGTGCAT 
CACCCAGGCG 
CGGATCAGTT 
TGTGGCCAGT 
CCCGTGGACC 
CGCGCGCCTG 
CCTGCATCCA 
CGATGCGCGG 
TGACCCCACC 
TGGCAGGCAA 
ATACGCGCTT 
CtGCGGCAAC 
CTCCGAGTAC 
CCGGCGTTTC 
GCAGGTCAAA 
GCTCCTGAGG 
CTTTAACCGC 
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AGACCCTGGC 


AGTCCCACAA 


TATCCACTAC 


CGGCAGTCCC 


CTCCGAAGAT < 


CCACTTCCAC 


12180 


CTTTATGACC 


TCGCCTTCAT 
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ATAATCTGCA < 


CCGTTTACCT 


12240 


CCCATGCGAT 


GCGCAGGACG 


CAGTGCGATA 


CACGCGTGTA 


CTGTGCATGG 


AGCTGGGCGC 


12300 


TACACCTGCC 


ACCCTGCCTA 


CGGTAGCCGC 


CGCATACCGC 


ATACCCATAA 


CGATAGCCGA 


12360 


TCAACCTGCA 


TAGCCAACGC 


ACACCCGGAC 


GTACCTACCA 


AGGAACGTAA 


CAAAAAAGGC 


12420 


GAAAAAAACC 


TCAAGTCCTT 


TTCCGTTTTT 


CCGAAAATAG 


GATTGGCCCG 


TTTCTGCTCT 


12480 


CGTGCGGTGC 


AsGGGCCCCT 


GCAGGGCACG 


GACGTCTTGC 


AGGAATGTCA 


GAGATTCCAC 


12540 


GGAGGAATGC 


ATGATTATCA 


ATCACAACAT 


GAGTGCGATG 


TTCGCGCAAC 


GCACACTCGG 


12600 


GCACACCAAT 


GTCCAGGTTG 


GAAAGGGCAT 


CGAGAAGCTT 


TCATCCGGcT 


ACCGCATCAA 


12660 


CCGCGCAGGG 


GATGACGCTT 


CTGGTTTGGC 


TGTCTCAGAA 


AAAATGCGCA 


GCCAAATCCG 


12720 


CGGCCTCAAC 


CAGGCATCCA 


CCAATGCCTC 


AAACGGTGTG 


AACTTCATTC 


AGGTTACCGA 


12780 


AGCCTATCTG 


C AAGAAAC C A 


CCGACATCAT 


GCAGCGTATC 


CGAGAGCTTG 


CAATTCAAGC 


12840 


GGCAAACGGC 


ATCTACTCTG 


CTGAAGACCG 


CATGCAGATC 


CAGGTGGAAG 


TTTCGCAGCT 


12900 


TGTGGCAGAG 


GTAGACCgCA TCGCTAGTTC 


TGCCCAGTTC 


AACGGCATGA 


ACTTGCTCAC 


12960 


GGGCCGCTTC 


TCCCGCACTG 


AAGGTGAGAA 


CGTCATCGGT 


GGCTCCATGT 


GGTTTCACAT 


13020 


CGGCGCTAAC 


ATGGACCAGC 


cATGCGCGTG 


TACATCGGCA 


CTATGACTGC 


GGTGGCGCTG 


13080 


GGCGTACGAA 


ACGGCGTGGA 


TGAGTCAATC 


ATGTCCATTG 


AGACTGCAGA 


CTCGGCCAAC 


13140 


AAGAGCATCG 


GCACCATCGA 


TGCTGCTTTG 


AAGAGAATCA 


ACAAGCAGCG 


TGCGGATCTC 


13200 


GGAGGCTACC 


AGAACCGTAT 


GGAGTACACA 


GTTGTCGGTC 


TTGACATCGC 


TGCGGAGAAC 


13260 


CTGcAGGcAG 


CTGAGTCTCG 


CATCAGGGAC 


GCAAACATCG 


CAAAGCAAAT 


GGTTGAATAC 


13320 


ACTAAGAATC 


AGGTGCTCAC 


CCAGTCTGGC 


ACTGCAATGc 


TTGCGCAGGC 


GAACACCAGC 


13380 


GCGCAGTCGA 


TTCTCTCAAT 


TCtCCGGTAA 


AGcCcTACGc 


CGCGTGCGCT 


CTTGTCCAAA 


13440 


AAGGGCAAGA 


GGAGTACACT 


GGGcCACAGG 


GGCTGCCCTG 


TGGTGCCCTT 


CTAGAATGAT 


13500 


CTTTGAAAAG 


ATTTCTCCcT 


TGCAGGCCTT 


CGTGTGGGCG 


GTTC TGAGGC 


TTTTTCTAAA 


13560 


AAGCTTCAGA 


ACCGTTTTCC 


GTGGCGCGGT 


GCGTGCAGGG 


TGCGGCGTGC 


TCGCCTGCGT 


13620 


CCGTGCATAC 


GGTTTTCCAC 


CCTATGGATC 


AAAGGAATAG 


AGGTGGAAGG 


ACTTCAGGGA 


13680 


GGGTCATATG 


ATTATCAATC 


ACAACATGAG 


TGCGATGTTC 


GCGCAACGCC 


AGGGAGGCAT 


13740 


CAACGGACTT 


GCAATTGCTA 


AGAACATTGA 


AAAGCTTTCG 


TCTGGcTACC 


GCATTAACCG 


13800 


TGCAGGAGAT 


GATGCTTCTG GTTTGGCTGT CTCAGAAAAA ATGCGTAgcC 


AAATCCGCGG 


13860 
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CCTCAACCAG GCAGGGCAAA ATATCCAAAA CGGTATATCC TTCATTCAGG CTACCGAAGa 
TACTTGGCGG AGACAACTGA AATCGTCCAG CGCCTGAGGG AGCTTGCAAT CCAGGCGGCA 
AACGGCATCT ACTCCGCCGA GGATCGCATG CAGATCCAGG TGGAAGTTTC ACAGCTTGTC 
GACGAGGTAG AC CGAATCGC AAGCCAGGCC CAGTTTAACG GCATGAACTT GCTCACGGGC 
CGCTTCTCCC GcGAGTCTGc CCTTGGGCCC aTG c AGCTGC ACGTCGGTGC GAACATGGAC 
CAGAATGAGA AAATATTCAT TAACACCATG ACGGCAAGTG CTCTGGGCTT TTTCTCCGAT 
GAAGGGACAG ACGGCAGTCG TTCCATCAGC ATTGCGACCG TCGACGGGGC GAACAAGGTC 
ATCGGTACGC TTGATAGCGC GCTCAAGGAG ATTAACAAGC AACGTGCGGA TTTGGGTGCC 
TACCAGAATC GATTTGAAAC CGCGTATCaG GGAnATCcTA TCGCGGCGGA AAATCTGCAG 
GCAGCCGAGT CTCGCATCAG GGACGCGGAC CTTGCGCAgC AGATGGTCGA TTACACGAAG 
AACCAGATTC TCGAGCAGTC G AC TATGGC A ATGCTCGCTC AAGCAAATAC ACAGCCACAG 
GCAGTGcTCC GCTTGATGCA GTAAGCGCAT ATCGCGACGG TATATTTTTA AACGTTCAGC 
GTCTTCATTA GAGAGTTTCG GATGTCATCT CTTGTGAACG CCCATGAGGG AAAGAGGAGC 
CGCATGCCTC TTTCCCTTTT CTTTTCTCTT TATCTTTCTG TCATCACACC cTGCGGTTGC 
CATTACGGCG TAAACGCTGG GGATGTACTT TGTATTTGTT AAGTTTGCGT ATCTGTAAGC 
CGACAAGCCG CGCGTGGGGT TTTTTTGAAG GGAACTCGGC TGGAGTCTTG ATCCACTCTC 
CGCAGCTTTC GAGGAGTCAT TGTGTCTGAT GTCCGCATCC CCGGAGTAGG GGCCGGTAAG 
TACGATAACC TCATCCAGTC GCTTATGAAA AAGGAGCGCA TTCCTCGGGA CAACGCTGCG 
GCAAAGGTGA AGGTTTcGAG GTTCAGAACA ACGCGCTCAA GGACGTGGAG CGGTATGCGC 
GCGATTTGCG TGACGCCGTC AAAGGACTCT wTTCCTTCAA cAACCCTTTC GCAGaGAAGG 
AAGCyCATTC TAGCAACGAG CGTGCGTTCA CCGTCGATGC TACTCGAGAC GCTGCCGAGC 
AGAaTCATAC AcTGCGCGTC AAAGACATCG CACAAGGGGA TGCGTTTCTC TCAGACCCCC 
TCCCTGAGGA TTTTCGCGTT CCCAGCGGGA CGTATACGTT CTGTATTGGA GAAAAAAAAA 
TATGCGTGTC GTGGAAAGGC GGGCaCTATC GTGATTTTAT AstGCCGTCA ACAAGCAGGG 
CAAAGACTCA CTCACCCTCT CAGAGATAAA AACGAGCGGT GCGAGCCGTG CGCTCCTGTT 
TCGCTCAGAA TTAACGGGAA AGAGCAGTCG TCTTTCCTTT GAAnGcaCTG CGCTGGACCT 
TGCACTGCGC cTGCGCGTCG TGCAGGAAGC ACGTTCTGAC GTTTTTACAC AGGATGTACT 
CAGTGTTGGA CCTGGAAAAC ACGCGCGTTT GGATTTTCCC CACCCTCTGC GCGCGCAGcA 
GGGcTTACGC TGGAGTTTGT CGCGTCTCTG GAAGGGGCAT CTATTGCAAA CGAAGAGTcG 
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cGTGCGCACA CGCCCGCACA GGGAGGCGCT CCCACGTCTT CCCACGGAAA TACGGCGTCC 
GCTGCACATA ATCAGGACGG AGCAGCTGCT GTGCGCCCTA CTGAACCGGC AAACGGCGCT 
CCTGTACAGG AAGAAACCAG TTCAGTGTTC TTTGAGGGGG TCACAGTAAA GAACGAGGCT 
TCCCAGGGAG ATCTGCCTAC CACGGACGGC TTGGAAAAAT ACCCAGCTGT CGACGACAAA 
GGAGAcAATC CGCGCGCaCC TGGAGAGTCG CAGGGCACGG CCACCCACGA AGGTTCAGGG 
TCGTCCACAG ACAACGCGGA TGACACACGC TCAACTGGTG CCTTGGCAGG ATCGGGTAAG 
CTTGCACTTG AGTCTCTGCA GGGCCACGCG CTTCCTTTAC CACCGCTGGT GCTTACACAG 
AACGCACCGC AGATGGTATC CATTCCTTTG CGCGAGTACG GGGATGTTCG CGCGCTCATA 
CTGGATAACG CGCAGGCGCG AGGCGCACTG ACACTGCGCG CTATCCGTGT GCGTGCCGAG 
GATGCACCAG GTGGTTATGT CCCCGTGAAC CCTGCCTCTC AAGCACAGGA TGCAGCGTTT 
GATTTCGATG GGGTGCACGT TACGCGCGGA ACTAATTCTA TCACCGACCT TATCCCCGGC 
GTTACGCTTT CGCTGCACGA ACGTACAGAA AAAACCGAAA CGCTCTCTGT CACCCCCGAC 
GTGAACGCCA TGAAGAACGC TATTATAGAA TTCGTTGCTA AGTACAATCG ACTCATGGCA 
GAAATTAACA TTGTCACCAG TAACAAGTCA GCCATTATCG ACGAGCTTGC GTATCTTACC 
CCCGAGGAGA AAAAGAAAGA GACAGAACAA CTCGGCAGCC TCCACGGGGA TTCCACGCTT 
CTTATGCTGA AAGACAGACT GAGACGCAAT ACCAGCAATG CGTACCGCGC CGGCGATGAC 
GGTGCATCGC GGACACTTGC ACACATCGGC ATTTCCACAA AAGCGCACGC TTCGTCTGGC 
ATT AAC AC GG CACAGctACG CGGTTATCTT GAAATTGATG AAGAAAAATT ACATTCCAGT 
TTGAACGCAC AAAAGGATCA GGTGCGTGCT CTTTTTGGGC ACGATTCAGA TGGTGAC CTC 
CTTGTGGACA ATGGCGTTGC ATTCACCCTA ACAGAACTGC TCAACCCTTA TTTGGGACGA 
TCGGGTATTT TTGCCATACG GTCAAACGGC GTTGACGAGC GTATTAAATC GACAGAAAAA 
CGCGTAGAAA CGTACGACAA GCAACTGGAA AAGAAGGAAC GGGAGCTGCG ACACAAGTAT 
CACACCATGG ATGGCGCGCT TCGTTCTCTA CAAAAGCAGT CTGACGCAAT TCAGAACTTC 
AACCAGTCTG TTCGCAACAG GAATTAGTGG GAGTCTTAAT GGACATTACG ATTAACGGAC 
AT ACAC TGC A GTATGTCATT GAACATGAAA AAACTATTGG GGAGGTTCTA GGCGCGATAG 
AAGCTGCGTG TAAAAAAGAA AAACAAACGG TATCGGCGGT GACGGTCAAT GGTAGGGAAC 
TGTCTGCTAA TGAATTGGAT ACACTTTTTT GCCAATCCTT GGATACCGAC GTCACCCTTA 
ATCTTACCAC TCTTTCAGGG GGAGACGTGC GTGCACTCTT GCGTGAGATT AGTACCACTC 
TCCTTGCACG CACAGCTGCG TTACAAGAAA TCGCAGTAAA CATGCATAGC GGTAATCTTG 
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CAGAgAGcTA CGCTATGGTC AGTGACTTTT CTGCTCTCTT GAAAAGTCTT TATCACTGCT 
TTACTCTCTC GGACATCGCT GATTTGGATC ATGGC CTG AG AATTAAGGGA AAAGCCCTGC 
ACGATTACCA GCGCGAGATT TCTCCCCTGC TTAAGGGCTT ACTAGAAGCA ATGGAAGAAG 
GGGACAGCGT TGCTGTCGGT GATATTGCGG AGTACGAGTT GGCACCGGTT GTTCGGGATT 
TAAGTGACGG TATCTTGCAT ATGGACATGG GTGTACAATG AAGTTTGACG GACTGATTCG 
CAATCTCGAC CACATTACGC GAAAGGATAC GTATCTCTAC TACCGGGAGG AGTTTTCTGC 
TGTTGCATGT T AC TCTCTC T TCGGTCGAAT TCATTCAGGA AGGGTTGAGT TTTCGGTAGA 
GACCaCTCCC GTTGGGGAAA AGAGCGTGCA GGTAAAATTA GTTGATGCAA TTGATTATCC 
GCTCTTACCG CTTGTACAAG CACTCAAGCG TGTAGTGAGA CTGTTGATCG AGAAGAATCA 
GTTGCCGCGT TAGATCTTGT CCAGTTTTTT TAAAAACGGT AGACTCGCCG CGGTGAGGTG 
CGTCAGTCGC TCAGCGCAGG ATACTGCgCG GtGGGGCACG GTAGTCGGAA GGCTGCTTGA 
GGAAGGTTCC GTAGTAGTCT TGCAGGGGGC GTTAGCGGCA GGGAAAACCT GTTTTGTAAA 
GGGGCTCGCT CTGGGACTCG GTATCCAAGA GGAGATTACG AGTC CTACCT TCACACTGCT 
GGCAGTCTAC CACGGcAGGc tGACGCTCTA TCATATGGAC GTGTACCGGC TCGCTTCCCT 
GGAAGACTTC TTTGATATCG GTGCGCAGGA GTGCGTATAC GGCACGGGAG TCTGTGTCAT 
TGAATGGGGA GAACGGGTCG CGTCAGAACT GCCGGAGTAC ACTGTTACCA TCTCGTTGCG 
TGTGCTCGCA GATGGTAACC GAGAGATTAC CGTAGCGTAs CgCAGAGTGC TTCCTGTCTT 
GCAAAAAGGC AAAGAGGGCG GGGTGTATGA ATATACTTGC CATCAACACC GTTGCGCATG 
CCCTCAACGT TGCAGCTGAA GGAGCACAAG GCACCGCTGT TGTGAGCATC GAAGGTGCGC 
ATTGTTGCAT ACAGCAACAG CTCGTGCGTG CGCTTGACGT TGTCGTAAAA CGCGCAGGAT 
TTCCTGTACA GGAAACACAA ATCGTTGCCT GTCCTCGGGG GCCTGGTTCA TTTACCGGCT 
TGCGTACCGG TTTTGCAGTT GCAAAAGCCC TACAGCTGGG TGTCGGAGCC CGTTTTATTG 
CCGTGCCTAC GCTGCGCcTT GCGGCACATC CGTTCCGCGC G TTC ACAGGA CGGGTGTTGT 
CCATAcTAGa TGCAAAACGT GGTCGTTTTT TTTGGAACTG CTTTAAGTCA GGAGAGCCGC 
TCTTTGAAGA CTCTCACAAC CACGCACAAG AAATCGTAAA AAAAGTGGAC ACACGGGTTC 
CATGCCTGGT GTGCGGCACG GGAACAGCAC TTTTTAAAAG TGTAATGGAA AGCCAGGACA 
ACACGGTTCC TTTCATGTAC GTAGAAACTG ACGCTCATGA AGGAGCAAAG ACACTCCTTG 
CTTTGGTAAA AGTGCTCAAT CACAGCGCCG CCACTCCGGG GGAGCGCGGA GCGCCGCAGT 
ACACAACACG AACTTACGCA AAAGGAAGCT AATACTATGG GCAATTCAGA TATCTGTTCT 
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GACATTAATG ATATCGAAGA ACTTCAATCT GAAGAAGGTG ATGCACCTAT 
GCCAATCCAA TCAGAGAGGA TTACAATTTT ATACGTGAAC AAAACCCCAT 
GGACTTGATC TTATCGGAAG TGCAAAACTG CCCATGCTCT TTTTAGACAG 
ATTGAATATA TCAGCGCCGA AGCGAATTCT CTTTTTAGAG G TT ATT AC C A 
AAGCCGTTCT TTAATGTGTT TGGGAATATC CTCAGCCGTA AGGAACTTGA 
TCTTGTGTCC GATCTCACTC TAAAGGATTT ACCTGGAGAG GCACGATGGC 
CGTGCAAAAA GAGCGCTATA CACGCGCACA AGTTTTATCC CGCTTTCCAT 
CAACCTTCTG GATATATCGT TCTTTTCGAA GACATTTCAG ATATGTACTC 
AGTAATATGC TGAGTAGTTT GCTACAAGCG TCAAAGCTTA AAGACAATGA 
CACTGCGAGC GCGTTAATCA CTATTGCAGA CTCATTGCAG AATACCTGTA 
TTATACCCCC AAGTCGATAC GGACTTTGTA GAGAATATCG CCTTTCTTGC 
GACGTGGGGA AAATTGGTAT TCCCGACTAC GTTTTGAAAA AACGTGGTGG 
TTAGAGTGGG AGCTCATGAA GGAGCATACT ATCAACGGTG CGCTCATTCT 
CCTGACCCTA TGGCGAAGGA AATAGCGCTC AGTCATCACG AGCGCTGGGA 
TACCCCTTCA AATTGGAAGG AGAGATGATA CCGCTTTCTG CACGTATTAC 
GATGTATATG ATGCATTGCG TATGGAACGC TCTTACAAAA AGGGATTTTC 
ACTACACACA TGATTTTAGA ACAGTCTGGa CAAAGCTTTG aCCCCATTTT 
TTTCAGAAAA TACATACAAA GTTCAACGAC GTGTGGGACA gCTACAGGAC 
CAATCCTAGT CAGAGATAAG GTTTTCTTCG GTGTCAAATT GCTGCAAGGA 
GTTTCTGTCT GCATGCGGGA AATGAGAGCA AACAGGTACC GTGCGGTGTA 
GTGTATGCGT GCACGCTCTG TGCATCCCAC AGAGTCCACT CCTGGTGAGC 
AGACGAAACC GTATAATCTT TCCGTCTCCC CGGTGAATGA CAATTGATTC 
GTCGGTGTAG GTCCAACATC AACATAAGAG AAACGTGCGA AAAACTGCTC 
AACACTACAG ACGTCCGACT CCTCACGATG TATTTCCCTG AAACTTCCAT 
TGCGATTGTG TGAAGAGTTC ATGGAACTGC CGCGTATCAC ACCAAAACCG 
CTGAAAAAAG GGGTAAAATC ATCCTCTGTC AAAAAAACAG CTGCGTTATC 
ATATAACGCC CAAGTCCTGT ACCGTTCAAC GCACCAAAGT ATATGTCGCC 
CCATTTTTTT TTGTAAACTT CATCTGCACT GCAGGTTGTT TTCCGAGAGC 
(2) INFORMATION FOR SEQ ID NO: 101: 



ACGAGAAAAT 
TCTCGGCTCA 
CAATCTGCTG 
TCTGGAGAGA 
AGACTTTTTC 
CCATAAAATT 
CAGCGACGCC 
GCAGCAGATC 
AACAGGGTTG 
TGACATCAAC 
AGCTATGCAC 
ATTAAACGAA 
TTCTTCTTAC 
CGGCACAGGA 
GAGCATCGCC 
TCATGAACAA 
GGCACGTGTA 
TGAGCATCCT 
GCTCATACCA 
AGCGCTCAGG 
AGCAGGCTGA 
TTTCCATTGA 
GACCTCTTTA 
CATTTGTATC 
CTTTTCATCT 
TCCAATTCGA 
TATAGCAGCG 
ATAGTCT 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22191 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

AGGAAGGAGA GGATTTTATC GTACGCGGTT TTCTCGATAG CGGATAGATT ACGATAATCT 60 

TGCACGTCAC TGCTCATGTT GATTTCTTCG GGGATCCAGA AGTTGTT c AT TGCCtGCCGA 120 

TACCACTTGC TGACCCAGGG ATACTTCATA TTGTTaAAGT CGTTGAGATT GGTAGTGTTC 180 

CCCCCGACCA TGCGTCGCTT ATGAAGTTCA ATGTCTCCTG CC t C ATT AAA CAGCGCGCGT 240 

CTTTGCAGTA TCGTTGAACT TTCCATCATG ATGAAACCTC CTCCTGAGTG CGGGCACAGG 300 

GTAGTATACT TGATGGACTT TGAATTCTCA ATTGCGGCGG GCGTAAAAAT GTCTGTCTGC 360 

AGGGCTCTGT GTCTTGTGTG GTGGTCCCTG TTTTGTGGGT AGGTTTGGGG AAACCTAACA 420 

GAGACGTAGG AAATCCGTTA TGCTCGAGCA CATGAAGCGT GAACAAGCAC GAAgTCAGCT 480 

ATCACACGAG CCTCCTAAGC GGCGCCGAGC CTCTCTAACC GTCTGCGGCC TGCGTGCTGT 540 

GGAAACGCTT GGCAGCATAC ATCCAGAGAA AATTCATCGG TTCTTTTTTA CCCCTGTGCG 600 

TGCGAAACGC TTCGGACCCC TGTGTGCATA CCTTGCTGCA CGAAAGAGGC TCTACCGTAG 660 

CGCTAATACG CAGGAACTTG AGCGATTGAC GCAATCCGTT CACCATCAGG GGGTTGCTGC 720 

TACCATAGAC GAGCCGCGCT TTCCAGCCGT GACTCATTCT CAGGTTGAAT TTTGGGTACA 7 80 

ACGGCGTGAG TTTGTTGTGT TACTCGATCG CGTAGGAGAT GCCCACAATC TGGGGGCGAT 840 

TATACGTAGT GCTGCTTTTT TTGGAGTGCA CTCACTGGTG GTGAGTGACT GTCGACAGCA 900 

GGCGCAGTTA CAAGCGCAAC ATATCGGGTT GCGCAGGGAG GAATGGAGTT TGTGCAATTG 960 

TTGCGCTGTA CAAATGCGCA GGAAGTATTG GAAATGTGTG CAGGTAAAAT GACCCGTGTG 1020 

GGAGCCTCCC CTCATGCGTT CAGATCGCTT ACACGGCTTT CAAACATACT CTCGCCTGAA 1080 

GAAGCGGTAA TATTAGTACT GGGAAACGAG GAGACAGGGC TTTCTGAGCA TTTGACTGCG 1140 

CATTGCGATC ATCTCTGTCG GATTGCAGGC AGTGGTCAGG TGGAAAGTCT AAATGTTGCG 1200 

CAAGCGGGTG CGCTTTTTTT GTCCACTATC GTACAGTTGC GTCAATCTCC TCAGGACTAC 12 60 

ACGCAGGgAC ATCGGGCCAC GCCACGTGCA CAAGAGCGTG TGCACCGCTG TGGGCAATTA 1320 

GAGGAAAAGG GGCAGAAAAA TGGAGCACGT GTTCTTATTC CCCGCTCGGG GGcGCGTGCC 1380 

AATTCCCGTG AAAGTTGAGA GTAGGGAAAG TAGACGGGTG TGAGGTATGG AACCTACAGC 1440 



Printed from Mimosa 02/03/22 07:28:30 Page: 675 



WO 98/59034 




PCTi 




674 



GCAATTGGTT TTGCAGTGCG TGTTGACGTG CGTCCGCGTG TGCCTCTGAG AACGCAGCCT 
GCTCTGTTGT CGCGTAGGAA AACTTGTAAA GAGCTTGCAC AAAGCGTCTT TTTTTTTTAC 
CCTCGACCCC GTTCGCCTGA ATTCATCACG TAGGAGTTGC CCGCATGAAT CAGATCCGCC 
TGTTTGCCCA GAGTGCGCTT GTGAGCGTCA TGGGTATGGG GATGGTTTTT GCCTTCCTCC 
TTTTGCTCAT ATGCGTTGTG CGCTGTGTGG GCGCGCTTGT CTCTTCTTTC GGCTGGGATC 
GCGGTCCTGA CGAAGGTGTC GGCGCTGCAG TCCCTGCAGG AGGAGCACTC GCCGCGGCTA 
TCGCAGTCGC CGTTCATGAG AAGGCAAGGA GTACTTCATG AGTACCCCGG TTCGCATTAG 
CGAAATGGTC CTACGTGATG CGCATCAGTC TTTGcAC get ACGCGCATGA CTACCGAAGA 
CATGCTCCCT ATTTGTGACA AGCTAGATCG CGTTGGGTAT TGGAGTTTGG AGGCGTGGGG 
AGGCGCCACG TACGACGCCT GCATTCGCTT TCTAAATGAG GATCCCTGGG AGCGTTTGCG 
TGCTCTCAAA TCTC GGTTAC CTAAGACCCC TATTATGATG CTTTTGCGTG GGCAAAACTT 
GCTAGGCTAC CGGCATTACG CGGATGACGT TGTAGATGCG TTCGTAGAGG CCGCTGCACG 
CAACGGCGTT GATGTGTTCC sCATCTTCGA TGCACTTAAT GACCCACGTA ACCTCAGTCC 
AG S TGCGCGT GCTGCAAAGA AAACAGGCAA GCATGTGCAG ATGGCTATCT CTT AC GCT AC 
CACACCCTAT CATACCGCAG AGAAGTACGT AGAGTTAGCA AAGGAGTATG CGCGCTTCGG 
TGCGGATTCT ATTTGCATTA AGGATATGTC GGGGTTGCTG AcCCCGTACG GGGCGTACGA 
TCTGGTTTCT GCCATTAAAA AGAGTGTCGA TTTGCCCGTT GAGTTGCACA CCCACGCCAC 
TACTGGTATG TCTGTTGCAA CCCTGGTGAA GGCGGCAGAA GCAGGTGTTG ATGTAATTGA 
CACTGCCATT GCTTCTATGT CCATGGGTAC TTCCCACAGC C CT ACAG AG A CTTTAGTGGA 
AATCcTACGG CACACGGGCC GTGACTCAGG GCTCGACATA AATCTCCTGC TAGAAATAGC 
AGCCTACTTC CGTCAGtACG GAAG TGCTAT GCCCAGTTTG AGTCTAGTTT TCTGGGTGCA 
GACACGCGTA TCCTCGTGTC CCaGGTGCCT GGGGGTATGC TTTCCAATTT AGAAAACCAG 
TTGCGTGAGC aGGGAGCCCT GGATAAGATG GACCAGGTTC TTAAGGAAAT TCCCCTGGTA 
CAGAAGGACT GCGGTTATAT CCCGCTTGTG ACTCCTACGA GTCAGATTGT AGGTACGCAG 
TCAGTATTGA ACGTGCTGTT TGGCCGGTAC CACCGGCTTA CTGCTGAGAC AAGGCGTCTG 
CTCACGGGTC AGTATGGCCG GACTCCCGCC TCCTGTGATG CAGGTTTGGT GGAGCGGGCC 
TTGAAGGAAG AAAAGTTATC GCAGTCGCTT GTCTGCCGCC CAGCGGATGC CTTGCCTCAT 
GAGCTTGATC GCATGAGGTC TGAGGCcgCs CCGCAGGCGC ACAGGATACC ATTGAGGATG 
TGCTCACGTA TGCTATGTTT CCCAAGATCG CTCCCACATT CTTTGCTTCC CGTGCGCAAG 
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GGCCTATTTC GTTCAGAGGA AAGGGGCAGG GGCAAAAACA GAAGGGTGAG AGTGCAGGGT 3240 

CGGTAGCTTC TTATGTGGCT ACCGTAAATG GTAcTGCGTA CACAGTTGTG CAGGAAGGCG 3300 

CTGTTCTCCG GGTAAATGGT ACTCCCTACA CCGTTAGGgT TGAGGCAGGC CCGTCCGTTG 33 60 

CTTCGGGTAC GTCGCAGGGT ACCGTGACTA CGGCAAAGGT TGGGGCGTGT ACTACGCTAC 3420 

CCGCGCCGGT CGCAGGTAGC GTAtTAAACA CACCGTGCAA GATGGAGCTA CGGTAAATTC 3480 

GGGGGAGACG GTGCTCATGG TGGAGTCCAT GAAGATGGAA CTTGAAGTGA AGGCCACCGC 3540 

TGCTGGTACT ATCCATTTCC TAATAGCGCC TGGCGCGCAT GTCAGTGCGG GGCAAGTCTT 3 600 

AGCAGAGATT CGCTAGAGGA TGTGACCATG AATACGCGTT TACCCCTTCG AGTACTCCAG 3660 

TGCGTGTTGG TGGGATTGCT TGTGTGCGGG CCCCTGTGTG CAGctACGCG CCGCCCGGTa 3720 

CGTGCTTCTG CGCCGGTGCC TATGGTACAG AGTTGTAAAG ACACGGGGGC ACGATGTGCG 3780 

CCGGCGTCGT CCATGCGTGA GGACATGCGT GCGTcACACG GAGcTGCGCC GCTTCTCTCT 3 840 

GTAAGGAAAT TTTTACTCAA TACGTGGCAT AGTACCGGTC TCTACGCTTT CTTTCATGGC 3900 

GTAACACAGG TGCCGGATCT TGCAAATCCG CaGCGAACAC ACAGCGTGTT CGGTTATCAA 3960 

CAGcGTTGCT GCTCGTGGTT GGTCTGCTCc ATCATTTATC TCGGTGCTGC TAAGGGCTTT 4020 

GAGCCGCTGC TGCTCATTCC TATTGGCTTT GGTACTGTCT TCGTCAACAT CCCTGGTGCG 4080 

GGCATGTATA GTGAGCATGG TATGCTCAAA CTCATTTACG ATGCTGGGGT GGGGAATGAG 4140 

TTTTTCCCTA TGCTCATTTT TATGGGTATC GGTGCACTTA CCGATTTTGG AC C ACTGATT 4200 

GCGAATCCTA AAATGGCAGT CCTTGGTGCC GCTGCCCAGT TAGGGGTGTT CCTTACTCTC 4260 

TTTGGGGTTG CAGCGTTGAA CTTTGTACCC GGGATCCGCT ACTCCATCCT GGATGCCTGC 4320 

GCCATTGCCA TTATCGGTGG GGCGGACGGG CCAACTTCCA TCTACGTATC TGCGAAgcTT 4380 

GcTCCCGAAC TCATGGCCGT TATCGCGGTG GCGGCATATT CGTATATGGC TCTTGTACCT 4440 

ATTATTCAGC CTCCGCTTAT GCGCCTGTTA ACTACCAGAA AAGAACGTCT TATTAGGATG 4500 

AAACAGCTGC GTCC TGTTTC GCGGATAGAG AGAGTACTCT TTCCGCTTGT CTTGCTCTTG 4560 

CTCTCGGTGC TGCTCATTCt GCGGCTTCCC CACTCATCGG TATGATCGCt TCGGGAACTT 4620 

TGTTAAGGAA TGCGGTGTTG TGGAGCGGTT GTCTAAGACG ATGGCTAACG AGCTTTTGAA 4680 

CATCGTGTCG ATCTTGCTGT CTTTGGGTGT TGGTTCTCAG ATGACACCCG ATAAGATTAT 4740 

GAACCCCAAT GCCTTGGGCA TTATCGTGTT GGGACTCGTT GCCTTTTCTG TCGCAACCGC 4800 

AGGGGGAGTA TTCATGGCAA AGTTAATGAA TTTGTTTTTG AGCGAGAAAA TTAATCCACT 4860 

TATCGGTTCC GCAGGGTGAG TGCTGTTCCT ATGGCCGCGC GTGTTTCTAA TAAGGTGGGG 4920 
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CTAGAGGAGG ATCCTTCTAA CTTCTTGCTT ATGCACGCGA TGGGTCCTAA CGTGGCTGGT 
GTCATTGGGA CCGCGATACC GCAGGGtGTT CATCTCGGCC TACGGAGGGT AGGGAGGAAG 
AGTAACCGCG GGGTTTTGCC GCTTAGGTAC CCTTTCCTCC GTGCGCGGGC ACACCCTCTC 
AGGTGGCTAG GGGCTTTTGC AGACGAAGCG GGTAAAGCTC GCTTGGAATT CGAGCTCAAC 
TGTCC CAATG GGGCCGTTGC GTTGctTCGC TAAAATGAGT TGAGTCTCGG TTTCGTTGCG 
GTCGCGGTGT AGAAACATGA CCACGTCGGC GTCCTGCTCA ATTGCCCCCG AACCACGAAT 
GTCCGCCAGG TTTGGCGCAG AGCCCTCTGC CGGTCGACCG ACTTGCGAAA GTGCTACGAT 
GGGGATGTCT AGCTCGCGCG CGAGGCTTTT TAGTGATTGG GAGATCGCTG CAAATTGTTC 
GTAGCGGGGC GCAAAGGGAT TGTCTGCTAC GATAAGTCCC AAGTAGTCGA CAAAAATAAT 
CTGGATCTTT TCTTGTACGC ATAATCGACG AGCCACGGCA CGGAGATCCA GTAGCTTCAT 
GTTTGGCACG TCCACGATGT AAAGTGGGGC GTCGTACATC TCTCtGCGGC GTTTTGGATG 
CGCCCGAAAT CGGAAAGTTG TAAAAGCCCT TTGCGCAGGT TCGTCGCGGA TACTCCTGAC 
TCTGCAGCGA TAAGTCGCTG CATCAGAAGC AAATTAGACA TTTCCAGAGA AAAAAAGGCG 
GTTGGAATAC GTTGCCTTAT GGCAATGTTC GAGGCCATAG TCATGGCGAG CGCAGTTTTC 
CCCATGGAAG GACGCGCACC TATGACAATA AGCTCGGAGT TCTGGAATCC ACCGGTAAGA 
TTATCCAGAG CCGTTAGcCG GTGGCAATTC CGACCAGATC GCTTTGATTT CGGTAACGAG 
TCTCAATAGT ATTGACCAAA TCAGGAATGA GGTTTTTCAG CAATTTGAAG GTTGCTACTC 
TCCTTGcATT TGTTAGGTCA TAGATTTCCC TTTGTGCTGT TTCGAGTACG ATGTTGCCCG 
ACACGGTGTC ATTGAATGCC TCTGCGGTGA TAATGCGGGC TACTTTTAGT AGCGACCGGC 
GCATGGCAGC GTCGCAAACG ATGCGTGTGT AGTATTCAAC ATtCGCGGCG CTTGGGACCG 
CATCGgTGAG AGAGGCAACA TACGCGCTGC CACCGACGAA ATCGAGCGCC TCACAGGAGC 
GCAGtGCTCG CTGAGCACGA GGATATCAGG GCGTTGACCT AAATCCGATA AcTCTACGAG 
TGCTTGAAAG ATGCGCTGGT GCGCAgcGGA ATAAAAAGAG CTCGCAGACA ACTGCTCTGT 
TGCCGTGCTC AG AGCAGAG T CATCCAGTAG AACAGCGCCG AGCACAGCCC GCTCGGCCTC 
TAGGTTATGA GGGGGAATTT TTCCCTTGAG TTCCTGAGTG GGATTAGGCA TGCCCGGCAC 
AGAACCTCCT CCGAGGAATA CTCAGAGGAG GGAAGGTGGT GAAAAGACAG TCCCCCCCCT 
CTCCGTGGAA ACCcTCTAAC GAGGCAAAGG GTTGAGGCCC AACTGCCAAG AACTGTCTCC 
CTTCCGTGGA ATCCAGCCCC AAAGTGGCAC AGTTCTTTGC ACAACTAAAA AACGAACAGC 
ACATACTCCG TGCAGAGAGA TACTTCCGCA AACGGTCTAC TCACTCACAC TGTCC gCTTC 
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GCTTTGGTTT TTGATGGTGA CAGGAACAAC AGCACATATT TCCTCGTATA GTCTTATAGT 6720 

GACGTGATAG TTCCCCACAC ATTTCAGAGT AAGACCAGGG ACCTCCACAC GcTTGCGCTC 6780 

AACCTCAAAT CCCATGCACG CAAGTTGTTC TGCAACGGTA TGACTCGTGA CAGCGCCGTA 6840 

CAACTTACCG TTGGTACCGG CGGGCATGGC AATAACCACA GGCTGAGCCT CTAAACGAGC 6900 

CTTAAGATTT GCGGCATCTT GTCGCTTGAG AGACTTTCGC ATCTCTATGT CCTGTTGTCG 6960 

CTGTTTGAAG CGAGCCACGG TAAAACGATT ATGAGGAACA GCAAGGTTTC GAGGGTAGAG 7020 

GTAATTACGA AAATAgcTGC GGCGACCTCT TTCACATCAC CTTCTTCACC AAGGATCTTT 7080 

ACGTCTTGAT TGAGAATAAT CTTCATACGT TTTGTCCCCT CCTACGCAGG AATAAAAGTC 7140 

CACGCAGAGC CACTCTGCCT ATGCACAGCA ACTTCTACTG AACTAGGAAA AACAACCGCT 7200 

GTATACCAGC GCTACTCGGT CAGAACGAAA GGTAGGAGCG CAACGGCGCG GGAACGCTTG 7260 

AC TTCGAGAG CAACACGGCG CTGGTGTTTG GCACACGTAC CGGTGATGCG TCTCGGCAGA 7320 

ATCTTACCCC GCTCTGTGAT AAAGCGACGA AGCGTGTCCG GATCCTTATA ATCAGCTAAA 7380 

AGCTTCTGCG TGCAAAAACG GCATACTTTC TTTCGATAGA ATTGTCTGTT TTTCTTGGGT 7440 

GCGCTCTCTT CACTTTCACG AGGAGAACTG AGATGCGTAT CCAGGTCAAC ACTCGGATGA 7500 

TCTTCTGCCA TGATATGCTC CTAGAAGTGG AAAAATTAGC ACGCCCAATT CCTTTCTGAG 7560 

GTCCAGGCGA ACGCCAGGTT CGGTCATACT TTTATTCATT ATGCATGCTC CTTGTGGGGA 7 620 

AAGAGACCTA AAATGGAACG GTGTCCAGAT CTGAACTTGA AAAATCAGCC TCATCAAGGg 7680 

ACGAGGTTGC GTCCAGTCCT CTTTTCTCAC CCAAGATATC GCCTGTGGGT GAC cTAGAAT 7740 

CTGAGGATGT GCCGCGGACA GACCAGCGGT CGACTCGGCG GCAACGCGC G AGGAAGAAAA 7800 

CTCACCGTCC TCGGCACGAG CAGCGCCACC GAGGACAGAA CCGAGGAGTT GAACGTTAGT 7860 

CGCAGAGATC TCTACCTTGC tGCGTGACTG CCCCTCTTGC TCCCAACGGC TTTGACGCAA 7920 

TTCCCCCTCG ACGGCTACCT GCTTGCCTTT GATAAGATAC TGGCTGATGA CTTCGCCCTG 7980 

GCGTCCCCAT AGAACGATAT CGAAGAAATT AACTTCCTCA ACCCAATCAT CACCACTTTT 8040 

CCTGCGCCGA TTGATAGCAA CAGAAAAACG ACACAGAGCA CCACCTGCAG AAGTGT AC TT 8100 

GAGTCCGCAT CACGCGTAAG CCGACCAACG AGCACTACAT GATTGACGTC TGCCATATCT 8160 

TCTCTCAGGA ATCGACACGC ACAAACAAGT GCGTGAGTAA GTCGTGTCGC AATCTGAGCT 8220 

TATGATCGAG TTCGCGCACC TTCCCCGGCT CACACTGAAC AATGAAGAGC AGGTAACGGC 8280 

CCCTCTTTTG CTTCTTCAGA GGATACGCAA GTTCCCGCTC TCCAATATGG TCTTCGCGGG 8340 

CGATGAcTGC GTCGTTTTCC TGTAGGAGGG CACGAACGGC GGTGGAACCC TGAAGAAAGA 8400 
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GATCCTCGTG TGCACTGAAA ACGGCCATTA GCTCGTAGGT 
ACGGCTCACC TTATCAGGCT GCACTAGACT CACCCGACAA 
ACGCGCGGCG AGaCGCCAGC GTAGTAArAA TCTGTGTGCC 
TTATCCGATG CTGCCGACGG CGCACGTCGT cTGAGGGGGT 
TACGGCAGAC GaGGTGAGGA ACCTGTTCGC AGAAGGTGCG 
GGTCACGTAT GTGTGGGGAG GTcTGTGGTG CAGAAtTTTG 
CACCTAGCAC GTTCTATTGA TGTCCTGTAT GAGGGGCTTC 
CTTTATGCGG AGAAAGTTGT GTATGACGTG CGTTTTTTTG 
TGCGCGCATA TCGATCGCCA GTCTCACTTC CCCGATTACT 
TTCTCCTGCG GTGCACGATT CTtGAACTTA TTGAATTTTC 
GTGACTGCGC AGGTTGAGTT TACCCGGATG CTCCCGTTTA 
TTGCATGAGA ACCTCGCACG CTCCATTCAA GAGGTGGACA 
GTGGTTTCTC AAGATGAGAT AGATGAGTTG CTTGAGCATT 
ACAAGGCGCG ACGTTTTGTG TACGGAGGTT GGCACGGGTG 
AAAAAACCAC GTATTACGTT GTCCGGATGG TCCTATCCGT 
TGAGTCGTTA TTTGCGATTT TGGGTGATAT GCGCGGCTGT 
CGGATCGGGA GTGTGCGGCT TGGAGGCTTA TTCACGCGGG 
AGAGTGGAAT GTGCGTTCTT TTTCTGTTTT GTTGCAGAAC 
TTTGGAATGT AGATGTATGG C AG TGG AGCG GTATATTGCG 
TTTTGTTTAT CTTGATCCAC CTTTTCCCTA TCGCTTTCAC 
TTCTCGTGCG TCATTGTGTA GAGAAGGAAG CGTGGTGATG 
AAAACTTGCG GATAAAATCG ATTCACTTGT GCGGACCGAT 
GGTAGTTGAT TTTTACCGCA GGATAAAGCC GGTCTTGCGT 
GATCTTGCgT TTTTTGTGCG AGGGGATGCA GAGGGTGCTg 
GTTaAAGGTA TACAGCGCCC TGTTTc C ACT CTCTCCGATC 
CAGGCGGGGC AATGCATACC TGAATGAgGG GAAGCTGCAG 
TACCACTGGT TACCACGATG GATTGACCAG GATTGGCGAT 
CGTGTTGaCC GCACTGCGgT TTTATTATTT TGCTCGCAAT 
CGTTTCAGCG CTCTCAGTGT TGATTCGATG TCTTATATAA 



PCT/^^13041 

CCTCATAATA ACTCCTATGG 8460 

GGAGGAGTAG CGTGCGCAGA 852 0 

GCATCAAGTC TTTTGTCCTT 8580 

ACATTCGTGC GTGTGCTCCT 8640 

CTTTCCTGTG TGCCCGCGTC 87 00 

TGTTTCAAGA cAATATTTAT 8760 

AGCTTAACTT AGATGAGTGC 8820 

ACCATGCGTT GCAAAAGTTG 8880 

TACCAATTCT TCATTGCCTA 8940 

TTATTCATCG TGCCTCTCCT 9000 

TTGAGAAAAG ACACAGCGCA 9060 

CGAGCGCAGA TGCGAGTCAC 912 0 

AGGTAATCCG CTTGCTTGTT 9180 

AGGATTACGG GGGGGATGCT 9240 

CCTGCAATGG ATCGCATGCG 9300 

TCCTTTTTGG ATCTTTTCGC 93 60 

GCGTATCCGG TAGTGTTTGT 9420 

GTGCAAGTGG CGCTGTGTCG 9480 

CGTGCACGTA CGCTGTTTCA 9540 

GCTGAGCTGT TGCAGCGGCT 9600 

GTGCACCGAC CAAGAGAGAA 9660 

CAGCGTGTGT ACGGGCGCTC 9720 

AGAACTGAAA CCCGTTGTAG 9780 

AGAGCGACAC GCCGTACTTC 9840 

GCGACCGGGg CGCTGCTAAA 9900 

GAAGCGGCAC GCGTTTTCAT 9960 

GTGTATATGC GCAAGGCGGA 10020 

GAGCAGAAAA TGAGGCCCAT 10080 

TTTTGACGCC GGGGGGGGGT 10140 
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ATGGGGCTTA TGGGGTGCGA GTGTAGTGTT CCGCCTTTTG AAACGGTTGA AACGGGCAAA 
CGGTCAGTGG ATGCGGACAG GATTATGCGT CTGTCCCGGG ATGGcTACGC GTTCTTAAAG 
GTCAATGACC TTGAGAGGGC AGAGAGCGAG TTCGGTAAAA TTCTCCAAAT TGAAGCGGAT 
AACAACTATG CCCTCGTGGG GCTCGGGGAT GCGGCGCGCA AGAGACGCGC GTACCAAGAG 
GCATCTGACT ATTACACGAG GTGCTTACAG CATTACCCTC GCAACAGCTA TGCGCTCTTT 
GGTCTTGCGG ACTGTTATAA AAACATGCGT CGGTACGTGA AGGCAGTGGA AGTGTGGCAG 
CAGTACCTGG AGCAGGATAG C C AC AAC ATT GCGGTGCTTA CGCGCATGGC CGATGCTTAC 
CGTAAAATAC ATGATTTTCA AAACTCGAGA AACCTTTACT CCCAGGTTAT CGCCCTGGAT 
GAACATAATT CCTACGCGCT AATTGGGCTT GCTCACTTGC ACTACGACTT CAAGAAGTAC 
CGTGAAGCAC TCATCTACTG GAAAAAGCTC CTGGAGTGTG CAGAACACAG TGTGGATATC 
CGTGTACTCA CCTCTATCGG GAATTGTTAT CGTAAAATGA AACTCTTTAG TCAGGGATTG 
CCCTATTTTC AGGAAGCGCT GAAGCGTGAC CCAGGTAATT TTTATGGATT TTTTGGGATG 
GCTGACTGCT ACCGCGGCAT GAACATGCAG GAGCGTTCCA TCCAGTACTG GGAGAAGATT 
CTGGAGAAGG ACACGCAGAA TCGTGTCATC CTCACCCGTA TTGCCGACGC ATATCGGCAC 
ATTGGGGAAT ACGAAAAGGC CCATCAAACG TATAAAAGGG CGTTGGATAT CGATTAC GAT 
GCCTATGCCA CGCTCGGGCT CGCAGTCCTT TGTAAACTCC AGGGGAGATA CGAAGAGGCG 
GTTGTGAGTC TTGATCGACT CGTGCAGCTT GATCGGAAAA ACTATCGCGT ATATGTGGAG 
CTTGCAGACT GCTACCGCAA GCTCGGGCAG AAGCAAAAGG CGCTTGAGAC ACTTCGTCCC 
TTTCAGCAGT TTGGGGTTAA GAACCGTGTT GTTTCTGAGC TTATGAGTGA GTTGGAGGGT 
GCATCGTAGT TTCCCCCCTT TTTCTTCCTG GTAAGCTTGG GTATGGGTGG CGTTTGCCAA 
GTGTGAACGG cTCGTTGCGC AGGTGTGGCT GCGCGATGCC TGTCGATTCG GCGCTTTTTG 
TGTGGTGGTC GGGGTCCTCC TCTTCGTCGG AGGTGGTGCA GGACTTTGCG ACGAGGGGGT 
GGAGCGTATA TGGAGTGGTG CTGCGCCCTT TCGGGGCTTT TGCCAGAAGA AATCCAGAAG 
GTGTGTGCGT TTGCTGAGCG CTTTCGTGGG GTGCAGGTGT TCAGATGGAT TGCCGCAGGG 
TGCACTGACT TCCATGCGAT GAGTGATCTC TCTTCTGAGA CGCGTGCACG CCTGGCGAGG 
GCGTGCGTCA TCTCGGACAC TCGTGTCTAT ACCACGCTGC GTGATGTGGA TGGTACGCTC 
AAGCTGGGTA TTGAACTGAA AGATAAACGG CGCGTAGAGG CAGTCTTACT CGTCGATCAA 
GTCTCGCGTA AGACTGCTTG TCTATCCTGT CAAGTCGGCT GCCCTATGGC GTGCGCGTTT 
TGTCAAACAG GCCAGTTGGG TTTCGCGCGA AACCTTTCTG CCTCAGAGAT CGTCGAGCAG 
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TTCCTTCATC TGGAACGATG TGTCGGTACA TTGGATAATG TTGTGTTTAT GGGAATGGGT 
GAGCCCATGC TCAATCTGGA TGCGGTGTGT AGAGC TATTG AGATACTGTC TCATCCACAG 
GGTCGTGACC TATCTGAAAA ACGTATTACT ATTTCTACGT CTGGACATTG CCGTGGTATT 
TATTCGCTTG CTGACCGCGC ACTGCAGGTT CGCTTGGCGG TGTCTTTAAC CACCGCGAAT 
GCACCGTTGC GCGCACGCCT CATGyckCnt GCGCACGACA GTTTAGCAAA ACTGAAAAGC 
GCTATTCGCT ATTTTAACGA GAAGAGTGGA AAGCGTGTGA CACTCGAGCT CGCCCTCATG 
CGGGGAGTGA ATACTTCTGA ACGGCATGCG CAAGAAGTTA TCGATTTTGC ACATGGGCTT 
AACGTGCACG TGAACTTAAT TCCCTGGAAT CCGGTAGCAT CAATCCACTT TGAAACACCT 
CGGGAAGTGG AGGTTGCGCA TTTTGAGG C G CTTCTCATGC GCGCCCGCAT CCCCGTGACA 
CGCCGCTATC AGCGTGGGAA TGGCATTGGA GGCGCATGCG GACAACTAGG TAAAACAGCC 
GGCGTGTAAC TCTTTCGCTC CGTTTGTAGA TGTTTGTACG TGTGGCCATG CTCGTTCCTG 
TTTTTCAGGA AAGTTTCTTT GAGGCAGCCA GCGCTTTTGC CTTGCTCGAG TAGAGCTG T A 
CGCAGTCCCA GGGTAACCAA CCTCTTTCTA CGTTCACCCA CAGCGCTGAC TCTTTATCCT 
TTGAAAGCAA ATCAATGCCG AGCACTGGAA ATATGTCTTT GCGCCGTGCG TACGCAATGA 
CGATCCCCGT GACACCTGGT TTGTCACGAA GCAAAACATA AGCCTTGGTG ATT AC TGCGA 
AGCGACGCGC TTCTCCAATT GGAGAGCTCG AGGGGAACCG TACATCTTCC AGTGAATTTT 
TCCGCACACA GGCGGAAGTG CTGCTGAGTA CGAGCAGTAC CAACGCCCAT GTGCGGCGCT 
GCAGCACGGG GATACCGCGT ACCTTCATAG AAAAAGCGTC GGCGCAAACA GAAGGTGTAG 
AATGCTCGCT GCGAGAGAAA ACAATCCCcA AGGGTTTGGC ATACTGCACC AGCGTGCGCG 
GAATAGGTGA CGGCTTGGAA AGGGCCACAC AATACCGATG TAGAAAAAGA GCACATCCTG 
TACCGCCGGC AAGGGTGGGT ATGAGATCCC CAAGCACGGG AAGTTGGGCA CCGATAGGAT 
TAACGCACTT GCACACTGCG GCAATTCCCG AGAGAAGCGC CACACTAGAG AGAAAGTCGC 
GTCATACAGA AAGGGACAGT CGTGCGCACT CTCCTCTTGG GAGTCGAGCA CCGTGAGGAT 
AAAGCCTATC GCAGCGTTGG TCGCCACAGA TAAAAAGTAA AACGGTAACA TGATGGTCTC 
CTCTTACATG TGGCATGACA TTGCCATATT TGAATATGCA CTCAGGATGT TTGTTCAGTG 
AAACACAGAG CATCTTTACA CATATCCACC ACGATAGTTG AGCCGCCTCT GAATCGGCCA 
CTGAGAATCT CACGCGCTAG GGCATTTTCC AATTCCGTTT GGATTGCACG CTTCAGTGGT 
CGTGCTCCGA AAGTGTCGTC GTATCCGCGC TCCGCAAGAT AGGCTTTCGC CGCGTCACGC 
ACACGAAGTT TTATATGTCG ACTTTCCAAA CGCTCCACTA CCATCTGCAG TTGGATGTCT 
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GTGATGAGGC 
TTTAAGAATT 
TGTTCTTCTG 
CGTGCAGAAA 
AAATCCACCA 
ATATTAAACA 
CTACGTCGTA 
GCGCCAATGA 
AGTGCACGCT 
ACCCCCGTGG 
GCCTTATTAC 
CGTTGCATGA 
GaTACCGGAA 
TCGCGCAAGA 
ATTTTTTTTT 
CCTTCACGTG 
CGTGACTCTT 
TCTAGGAAGC 
CTTTCTTTAA 
TCTAGCTCAA 
AGATCAATCG 
GCTGCAACAA 
AACCGCGCAG 
AGCGGCGTTC 
CGCCGATTGA 
TAGATCCCTC 
CGCCACCGTC 
CGCGGAACTT 
TGAGGCTTTC 
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GAATATGTTT 
CTGGGCGAAA 
TAGATTGTGT 
GAATATGCTC 
CCCTTCCTTG 
CATCCTGGTG 
CCGCTTCTGT 
GTCGGCTGAT 
CATCGTTGAA 
GACCGACACA 
GACGAATCGC 
GTACTGACTC 
TTCCGGTCC A 
GCTGTCCTTC 
CAAGTTCTGG 
AAAACATGGT 
CTATCCTCCC 
CTGCGAGCTC 
GGAGAGAGGC 
CAGGCTGACT 
CCTTATCTGG 
GCGCTTCATC 
GATAGCAATG 
GAGCGCTGCG 
ACGCAATTCA 
ACTTGCGCCG 



TGCACCGGCA 
TGGTACGTCT 



CCGTGTGAGA 
GTATGTGTGC 
CTCGGGTACA 
TGATCCGATA 
GCCGTCAGTC 
CGCTTTCTCT 
CAATTGTCCC 
CGCGTGTTTT 
AAGAAAATCA 
TAAGAAACTG 
GTCGGAAATT 
AAGCTGCAGA 
CATAGAAATA 
TAGACCGGAT 
AATTTTGCCA 
TTCCTCAATG 
TTTCTCATTT 
TTTTTCTAAC 
CTTTTCGATA 
TTCAATTTCC 
TAAAAAGCGG 
TTTGATACGT 
GTGTCCTCCA 
TCCTTTTCGA 
CCGCGCGCAA 
GCGCCTACGA 
ACCGCTTCAA 
ACCAATGCGC 
CCTGAAACGA 



CGCTTAAAAA TTAACACTTC GTCAATCCGG 13680 

AGTAATCCCC GTATCTGCTC TGGTAGAGTT 13740 

GGCAAGTCCG ACGTGTGTGT GCGCGACTCG 13 800 

TTGCTGGTCA TGATGATGAT CGTGTTGCGG 138 60 

AAGCGCCCAT CGTCGAGTAT TTGCAGGAAT 13920 

ACTTCATCAA AAAGAAGTAC GCTGTAGGGT 13980 

CCCTCGTCAT AGCCCACATA CCCCGGGGGC 14040 

TCCATGTATT CACTCATATC GATACGCGTC 14100 

GCTAACGTAC GTGCAAGTTC TGTCTTTCCT 14160 

CCAAGAGGAC GGCGCGTATC AGAAAGTCCT 14220 

ACCCGCACTG CTTCGTCCTG CCCTACCACA 14280 

TATTTCTGTT GCTCGCTTGC CATCATTTTG 14340 

ATTTTCGCAA TGTCCTCTTC ACACACTTCC 14400 

TTTTTCTCTA CTTCTGCAGT AAGAAGCATG 14460 

TACCGAAGTT CTGCAGCCTT GTTCAGGTCC 14520 

CGGAGACGCT CAAGCTCCTC TTTGTAGCGG 14580 

TGCCATTGGA CCTGCATTGC AGCACGGCGC 14640 

TTTTCCAAAC GTTCCTTTGA AGCCGGATCA 14700 

TTCAGCTGTA ATATCTTGCG CTCCACCTGG 14760 

ATTTTCAGGC GGCTTGCTGC TTCATCCACC 14820 

TTGGTGATGT AACGGTCAGA CAAAACGGTT 14880 

ACCCCGTGAT GCAtTCGTAC TTTTCTTGCA 14940 

CCGTAGGCTG TACGCAGTAC ACTTGCTGAA 15000 

TATATTTGCG ATATTCGTTG AGCGTGGTTG 15060 

gcgCAGGTTT CAGAAGGTTC GACGCATCCA 15120 

GCGTGTGTAG TTCATCAATG AATAAAATAA 15180 

TTACCGCTTT TAGTCGTTCT TCAAATTCCC 15240 

CGAGGtCAAG GGAAAGCAAA CGCTTTCCCT 15300 

TACGGCGTGC AAGTCCCTCG ACAATAGCGG 15360 
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TTTTCCCTAC GCCGGGTTCT CCAATAAGCA CTGGGTTATT TTTTGTACGA CGTGAGAGTA 
CCTGCATAAC GCGCCGGATC TCTTCATCAC GTCCAATAAC CGGATCTATT TTTTCTTCTC 
GGGCGAGGGT AGTAAGATCT CGGCAGTATT TCTCCAAGCA CTGGAATGTT GATTCTGGAT 
CCTGGCTCGT AACGCGCTTG CTGCCGCGTA TATCTTTGAG GGCGGCACTG ATAGTTTTAC 
TGGTAATGCC CTGACTGTGA AGGAGACGTG CAGTGTTGCT ATCTGTCTCA CTTATGGCAA 
GCAGGAGATG TTCGCAGGAG ACATATTCAT CTTGGTTCTT GAGCGCGAGG CGTTCTGCAC 
GTGcACAGGC TTTGCTCAGC GTTGGTGCAC AGCGCGTTTG GGCGGCAGGA CCGGTAACAC 
GTGGTTTGCG GCGCAGGCAT TGGAGTAATT CATCGTACAG GAAGTCCGGT TTTGCCCCAA 
TTTTTTCAAT GAGCGGAGAG ATAATCCCGT CTTTCTGGGA AAGTAGGGCG TGGAGTAGAT 
GTTCCTCCTC AACTTGACCG TGGTTCTCCG CTTCTGCCAG AGATATGGCG TCATTGAGCG 
CTTCGCTTGC TTTGACTGTG TACCTGTCTG TGTTCATGGC G TG ATT A TAG GTCTTTTGAA 
CGCTTTTTtC TCGTCATCGG TATGTTTTTT CTACCGCTTG CAGGGGACTT ACGGGAGTAG 
TCGCGGTGGA GAACArGGGT GTACATGGTA TGCGGTGCGC TT t GGCAGGC CGCGGTAAGg 
CGTAgCcTTT TATATTTTCT tGTTTTGAAG TAGGCTCCTG CGAGTTGGGA GGTTGGGAAT 
ATGGAAAAAA CGTTAACGCT TTTTGGGAGC AACCATTGTG GACAATGCGC TACGGCGCTT 
GAGTATTTGC GGAGCAATCA CATTAACTTT GAGTACGTGG ATATCACCGG CAGCGGGAAG 
AACCTCAAGC GTTTTTTAAA GATGCGTGAT TCAATGCCAC TCTTTGATGA CGTGAAGAAG 
GAAGGGCGCA TTGGTATCCC GTGTCTTTCG GTGAACGACG GAGAACAGGT CTTCCTTGGT 
GTGGAAGGTT TGGACCTCTC GGCGTTTCGC TAGGGGTTAG TGGGCCGGTG TGTTCGGGGG 
CGTGATGTCG TCCCCCTCTT TGCGCTCTTT TCGGGCGACC AGCTAGGCGG TTTGCTCTGC 
TTCAAAGCTG CTCTGGAGCG TCTGTGCGGT TGTGCGCAgC TGCTGTTCAG ATTGAATGCG 
GAAGAAGGAG AGGAGGCTGC GTGGGATCTG CTCGTAAGGA GTTTCGCCGT CGAGAAAGCG 
GATGGTCATG TTTGCAAACG CCACGGTGGA GATGAGCGGC TTATATTCCT CAGGAGCGTG 
TGGCAGATTG TGGTGAAAGC GAATGGTGTT GATAACCGGT TCAGGTAGAT TCCAGCGCTG 
TGCGAGCGCC GCGCCAATTT CCGCGTGGCC TACGTCGGAC ATGATGGTGT CGAGCACATG 
CGGGGGTATG TTACGctCCG CCTGAATTTC TGTGAGTTTG ATGAGCATCT CAGGGTATGC 
GGAGGTGAAA ACGACTTCCC CCAGGTTGTG CAGGAGGCCG C AG ATGTAAG AGTCTGCGAT 
GAGCGCCTGG TCGCCAGTGG CTTTGGCAAG ACCGAGCGCA AAGTAGCCGG TGCGGTATGC 
TTGGTTCCAC AGCTGCTTTC GCTCATCGTC GGTAGACTGC AGCACGCGCC CGGCGCCGAC 
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TGAGTACAAC AGATTCTGTA ATCCGCGCAG CCCGACGCGC TTCnAcCgcT TCGCTGATAT 
CCAAACAACG TTTGTTCATA CCGAAGCGCG CGGAGTTGAC GAGTTTGAGA AGGTCGGTTA 
CGAGCGCTAC GTCTTGGCTA ATGAGAGCTA CAATGTCTGA GAGTTGGACG TCAGGATTTT 
CAATGGCGCG TTGAATTTCC AGTAGTTTGC TTGGCAGCTG GGGTATGTCG TCGATGCGGT 
CGGCGATAGA CGCGGCAAGC TTTGTGGTTT GAGTTTGGAT CTCAATGTTG CGGGGAACTA 
CCATGCGCGA AATGGTTCGA TCCTCTTCCA CTACGAGGCG GTACACGTCT TCCTCAAGTC 
CGAGCTTTTT GAGCATGAGC ATCATAATTA CGAGCCCGAG GCCGGCCCCC TCGGAATTGT 
CCAGGATGTG CGCGAGTGCC TCTTCCAAAC CAGAATAACG GCGCGCACGA ACGCGCTTGT 
CAAAAACGCG CTTAAATTCC ATGGGAGTCA TTTTGCAGTT ATTGATCACC TCGATGACAA 
GCGCATGTGA GAGCACCTGC ATGGTCACCT TTACGTACAA CCCCTCCTGT TGTTGGAGCT 
TGAGGTAATG ATTGATGTTC TCAAGGCTTT CTTCCTTGAA GCTTTTCATA CCGCGCTTGT 
AGTCCTGTGG ATCAAAGATG TCCAGTCCTT TTTCGCGGAA GTAAATACGC TTGGTGTTGG 
CCTTTTTCGC GTTTGTGGTG AGTTCACTAA TGCAGTACGT TAAATAATCC TTCACGTGCG 
GCTGGCCAAT GAGATCTAGG ATTGTTTTTG CCACCTGTCC AATATAGATG TCCATATCAC 
GGGGGAGCGT ATAGGTGGTG ATGGCAATCG GGAGCTGCAG TTCAATTGCC TTACGAATTT 
TTTCAGTGTC AACAACTATT TGTGTATCAC ACATGGGCAG ACTGTACACG CTGTGTATGC 
TTTTGACAAC ACGCGCGAAA TGCACAGCTT TCCCTGTGCC AGAATGCAAA CGCGTGTGGG 
TGCGGTGTAT GTCTTTGTTG ACAACGGGGA GAAAAGCGTA CGAGAATCTG CCCCGTTAGG 
TTGAAGGGGG AGCAAACTGT ACGTGCTCAT GTAATCTTTG TTTTCCCGTA TCGATATGGG 
GTGCGATGCG CGCGGTGGAT ACGCTCCTTC TCGACTGAGG TCGCAAGATT TTGTAGGAtG 
CGTAGGGGCG TGCGTCGATT GGTGTGGTGC TATGTGTCGT GCTGGTTTGA TGTATCAGGA 
CTGGGGTGTG AGGGGAAAAA CGGACAGGAT GTGCTTTGCG CGGAGTCGGT TGTTTTCCCG 
TGGTGCGGTG GGCACGGTGT GCTGCACTGT GTTGTTTCTC GCGTGTCGTG TGCGCACTCC 
TTCTTCCGTG CCTCTGCGTT CTGGATCGGT CCGTGCCGCA GTACCCGAGG CCACATCCTT 
TCACTGGCGG CGCTATGCGG gTACGCGCcT GCGCGTGTGT TTTCCGTACC ATGCTTCTTA 
TCGCGCACTC AAAGCGATTG TTCCGGAATT CGAGTCACTG ACGGGTATCG CTGTTGAAAT 
TGACTGGCTT CAGTATGCGC GCATGCACGA TAAGCAGGTA CTCGAACTGA GTAAACCGCG 
TGGTGACTAT GACCTTATCG CTATGCTGTG TACGTGGAAG ACGGAGTACG CGTCGCGCGG 
GgCTACTCCG GTCACTCGAT TCGTTTTTTC AAAATCCTTC CCTGTGTATG CCGCATTACG 
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ATTTTGAAGA TCTCATTCCT GTGTACGTAG AGACAATCGG GTATGTAGGT GGACGCAAAC 
CCTGGCTTGG GGGTCCGGGT GCGTTTTTGT GCGCCGTGCC CTTTGGAGCG GAGACGAGTA 
TACTTGCCTA CCGCAAAGAT ATC TTCC AC A AGTACCGCAT CAAGGTACCA GAAAATTATG 
ATGAGCTTTT GGATGCCTGT AAGAGACTGC GTGAACGCGC GCAAC TAT AC GGTCTTGCAA 
GCCGCGGCGC TTCAGGACAG CAGGCtTGCA CGCATATTTA CTACACGCCg CACCCTTTGG 
TGCTAAGGTG TTGGACGATT CTTTAGTTCC TGCATTCCAT CGTTCGCGTT CTATCGCCAC 
ATTACGGTGG ATGCAAGAAA TGTTCGCGTA CGGTCCACCG GGTATGGCGA GTTTTGCCCA 
AAACGAAGCA TTGCAGGCCT TTTTGCAAGG GCAGACGGGG ATGTACTTGG ATACCAATAT 
GATTGGACCG TTAGTTCGTG ATCCGACACG CTCAGCCATA CGCCCGCACC ATGTGGGATT 
TGCgTTGCAC CCGATGGCGC AGGTGCGTGC AGGAGAAGTT GGCGGCTTCG GGCTTGCAAT 
TCCACACAAT AGTGctGCGC CCGAAGCAGC GTTCTTGCTT TTGCAGTGGA TTACGGCGCC 
GCAAACAGGA CGGCGAGTGG TGGAACAGGG TGCGCTACCA TTCCGCCAGT CGCAGCTTGC 
GGACCGCGCA TTGCGTG C AC GTTTTGCTGA GTTTGAGGTA TTAGAGCGTC AgcTTGcACA 
TTGCGATCCA GATTGGCGTC CTATCGTGCC TACCTGGGGG GAGTTGGGAA CCCTTTTGGG 
AATTGGG ATA AATGAGGTGC TCACCGGTGT GAGTGAGCCA GAGGAAGCGA TGAGCGCATT 
AGTGCTGCCG GCACGTCGTA TTCTCCGTCG TCACGCGCAC GCACGTTATG TGCCGTAAAA 
AGTCTCTTCT TcTTGGGGGa AGAGAGGGCA GAAAAGATGC TGGATATGAG CGGAAACAGA 
ATGACGTGTT CTATCTGTGG AGCGCGTTAC GTTGGGTAGC GTAAGAGGGG GAGTGCCGTG 
GGGAAGAGTA TGAGCAAACC ACGTGATACC GTAAGCGCGT ATTGTTTTTA GCGCTCCAGC 
GCTTGTATTG CTGATGTTCG TGCTGGTGTT GCCGGCCTTT TTGGGGCTCC GTATGArTTT 
TTTTCAGTGG CAACTGaACG CGTTGCGCAC GCAACCTGTA TTTGTGGGTT TTGAAAATTA 
CCGGGAGCTT TTCTCCAGTA TTCACTTTTG GGCCAGTGTA AGAACGACGC TTGTCTTTAC 
GCTCTCAGTG GTGGTGCTTG AGGTTGTACT TGGACTTGCA CTCGCGCTGG TGTTGGAACA 
CGGAGTACCC GGGTTGCGTT TTTTTCGTAC AGTGTTTGTG TTGCCGATGA TGATCGCTCC 
CGTGGTGGTG GGAGTGCTCT GGGGGTTTTT GTATCACCCA CAGTTTGGTA AAATCAACTT 
ACGCTTGCAG GCTTTTGAGC TTGGACCGGT ATTGTGGCTT GCGAATCCGC GTCTTGCGCT 
TTTGTCTGTG ATACTCACTG ATGTGTGGCA ATGGACGCCT TTCGTATTCC TTGTGTTGCT 
TGCAGGTTTG CAGGGTATTC CGCAACATCT TTTGTATGCG GCGAAAGTGG ACGGAGCAAA 
TTATATGCAA ACACTCCTGC ATATAAAGAT TC CAC AT ATT GCGCCTGTGC TTGGCATTGC 
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GACCGTCCTC CGTTTGATAG ATTCTTTCCG TGGTTTGGTA GTGATTATGA CACTGACAAA 
TGGTGGTCCG GGAGTTGCAA CAGAAATCCT GnCCGCTTCA CTTGCAGCGT ATTGCCTTTG 
AGGATCACCG TCTGGGTAAA GCATCGGCAG TTGCTGTGCT TCTGTTTCTC CTGACAAGTC 
TTTTGACTTG TATTTTCATT CTCCTTACGA TGAGGAGACA GGCGCGGTGA GGGTTGCGTA 
CGGGTTAGAT AAGAGCAGGA GCAGATAGCA TGAACATGAT TTTTTTGAAG TGGCGTACCG 
CGTTGGTGTT GTGTCTGTTA AGCTGTATTG CGTTGGTGAG TATGTTCCCT CTCTATGAAA 
TGGTAGCTAC TTCTTTGAAG CGTGATGCGG ACGCATTTCG GTTGCCGCCA GCATGGTTTT 
TTATACCAAC AATTGAAAAC TATCGGCAAC TCTTGCAGGA ACATCATTTT GGACGTGCCC 
TGTATAACAG CTTGGTGGTG ACGTTGAGTT CCACGGTGGT GAGTGTCAGT GCAGGGGCTG 
CAGCAGCGTA TGCAATGCAG CGCTTTCGGT ACCGAGGTAA AAAGGCAATC ACGGTGGCGT 
TGTTGCTCTT GCGAGTGATT CCGCCGGTTG TGCTTGTAAT TCCTATCTTT GTGTGGTGGA 
CTGCGCTCGG GTTAGTGAAT TCTTTAGCAG GACTTGCGcT CGTGTATGGT GCGCTCAATG 
TTCCATTTAA TGTGTGGGTA ATCACTACCT TTGTTGCGGA AATTCCCCCT TCGCTGGATG 
AATCTGCAAA ATTGGATGGA TGTTCTCACT GGATGATTTT TACCCGCATT GTGATGCCAC 
TGATTACACC CGCACTTGCG TGGTGAGTAT TTTTACATTT CGTTTTGCAT GGAATGAGTA 
TATGCTTGGA TTTGCGCTGA CCAATCGGAA AACACGGACA CTGCCGGTGG CACTTTCACT 
TTTTCTCACG GATAGTGGTG TCGAATGGGG GCGGATTACC GCAGCAGCAA CGCgATTGCA 
ATTCCTGCAT GTGTTTTTAC CTTTGCGGCG GCGAAGTACT TGGTGGTGGG TTTGACCGCA 
GkGcGGTAAA GGGATAAACA CTCTGCgCGG GTGAGTACGT GCAGCAGATA TGTGcGGCGC 
ATCCCTGGGA gACTGCgTCG GGTCGTGTGC GCGTGTCAAT GCgTTGTATG TAGGGAGAGA 
TGGGGTGGGT GCaGAAAATG TATGGGGCTG TGGTAGGTCT GCGGTGAGAG AGAGTGCgCA 
CCGGGATGGC aTCTACATCG TTGGCGCAGG ATTCGCAGGG AGTGTCCTTG CCCGTGAGAT 
CCAAACGAAA AAAGTACTCG GCACAGTTAT TGCTTTTTTG GATGACGATC CGTGCAAAAT 
CGGATCGAAT CTTCACGGTG TCCCGGTGCT TGGTCCCATT TTTGAAGTTG CCCGGATTGT 
GCGTATTACT CCGCATGATC ACGCGCTGAT TGCAATTCCT TCTATCTCCA TTGAGCGTTT 
GCGTGACATT TACCTGnACT GCGCGCTGCG GGGTTTACGG TTATCAAACT TCTGCCGGCG 
CTTGCTCAAA TCATCGATGG TACTGCGCAT TTAGTGCAAA CACGTGAAAT T 
(2) INFORMATION FOR SEQ ID NO: 102: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 5420 base pairs 
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<B) TYPE: nucleic acid 

(C) STHANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

ATCTTTTCTG CACAAAAGTT ATAAAAAGAT TTGATAAAGC TTTCGATTTC CTCCATCGGT 60 

TTTGAGCnCG CACGCCAAAT GCGAGAAAAG TCCTTCATCC TTAAAAGnTT GCGCnTTCAC 120 

ATTGGGGAAA AAACGCGGCC ATACGCGGCC GCACATCGGT AAAACGCCAG GGAAACTCTC 180 

CACCAAACAC TTTGGCCAAC TGCGTGCGCA CCACCTGCTC ATCCCATACG AAAACTTCCT 240 

GACCCTCCAG GAAGGCACCA AGACGCGTAT GAGATGGCAG GAGCATACGT GGGACCACCG 300 

TTTCCGAAAG ACGGTCCACC ACCTCCATAT TACACATTTT TACAGCCTcA ATCGTATACG 360 

CCCCaGTCAC CGGACGAACG TGTAGAAGCA CCACAGGCCG CTGATACCAG CCGCTGGTAC 420 

TGAAACGGAG CGCCGTTACC ACCTTCCACG CAGTTGCACA GCCTGAGAGA AACTCGTGCG 480 

GATACCCTGC ACCTGGCACT TTTGCATTAA TGATAATTGC ATCAGGGAGT GTCTCAGGGG 540 

CCGCATGGTG ATCAATAACC AGTACGTCAA TACCTCTGTG ACGCGCATAC GCAATTTCTG 600 

CGCGATTAGA AATACCGCAG TCAACGGTCA CAATGAGAGT ACCCCCTGCG GCCGCATGCT 660 

CATCTACTGC ATTGCACGAG AGCCCATAGG GCTCATCGGC AACCGGCACG CGCCAGCACA 720 

CCGTCAAGCC AAATGCACAG AGAGCCTCAA AAAGAAGGGT GGTGGCGCTA ATACCGTCTG 780 

CATCACGATC ACCAAAGATG AGCACCTTTT CTTTCCGCTC ATGCGCAAGA CGCAAACGGT 840 

CAACTGCAGC GCTCATCGCA TGGAAACGGA AcGGTcCTGG CAGATAGCGT AGATCACGCT 900 

CAAAATGAAA AAGGAGTGCG TCAgcATnCC ACTACCTTGC GACGCACAAG GATGGCCGCC 960 

TCAAGCGCCG AGCAGCTATA CCGCGCGGTA AGCTCGCGGA CACGCCCTTC GTCAAGTGCC 1020 

TTTGCACGCC ACCGTTTCAT GCGTAsCCTT CCTCCGAGAG AGTACCACCC GACGCAAGAG 1080 

GACCCGCCAG AACGGCCACA CGCACCACAC ACGCGCCCCG GTGCcGCGAC GCTTTCCGTC 1140 

* GGCGAAgcaT CtTCCTCTAA AGAGACTGCC CAATCTTTCT TACGGACCGG TATCCAGACT 1200 

CTGCACACCA CGAGCAGGAT CGTCCTGCAC GTCTTCTGAC TCCTCCTCTG GAGCGTCACG 1260 

CGACGGGTCG GATGCAGACG GCACAGGCAC ATGCGGCGCA TCAAAAAGCG GCGCTTCCTG 1320 

CGGATGCACC GGCGCAAAAG GATTGCTCGG GACAAGCACC TCCCCCTGCA CGCCCGGTGA 1380 

ACCCAGTGGC GGATCTTCAA AGACGCGCGG GTCTATATGC AGCCCTTCTT CGsTcACTGC 1440 

CGGCCAGAAC CCAgTGCTGT ATCCAcTTTC CTTCAACCGT TCTTTAGCCA GTTCCTCAAG 1500 
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CATCTTCCCA CTTTCATGGT ACGAACAAAA GCGCTCAgGC AmCGTGcCAr GCAAAAATGA 
CAGAGAAACA AGCTCATCCC CACAGTACGG GGTAGGCAGC AATCCTGATT TTGCACACAC 
CGCCTGGTAC ATCAACCCTG TTTCAGGACG AACAAATGCC CGGTGCGGCT TATCCTGATG 
AATAGCCCGC ATAAAACGCG CCCACGGAGG ACCTGCAAGC GTCGCGCCCG TGCTATGCAA 
TCCGAGCGAT CGGTCTCCTT TGTCAAAGCC AAACCACAAC ACTGCAGTGT AATAAGGAGA 
GTATCCAACC GCCCACGCAT CAGAC CAGTT TTGCGTAGTC CCCGTCTTCC CCGCAACCGG 
CATAACAAAC GATCGCcCCG TTGCAGGGTC TTGGTATGTA AATGCGCGCC CCCGCTCAGA 
GGCCACCGCC AACGTCCCCA TCGTTACCGT TTTCTCTAGC ATATTCGTCA TGAGCGCCGC 
GTTCTCCGCA GAGATCAGTT GCGTTGCCGC ACCcTGCGCG CGCAGgCGGG CCCGCACTTC 
C CGTTCTGG A TCCAAAATCA CCCGCCCTAA ACGATCCTCC ACTGAACGCA CTGCAATCGG 
TTCTACCGCT TTGCCCCCAT TTCCAAACGC TGCAAATGCA CGCGCAAGCT GAATCGGCCG 
GAGGGCAACT ACGCCCAACG CAAGCGGATA GACGCGTGGG AAGGTGCGCT CAATCTCCTG 
CCGATCGGTT ATATGCAGGA GCGTCGCCGC ACGCTGGATT ACCGCGTCGA AACCGACCAT 
ATCCAGTACA CGAATAGCAG GAATATTGAG CGACTGCGCA AGcGCCTTCC ATGCAAGCAC 
TACCCCCTGC CATTTTCCCC CATAGTTGTT GGGAATATAC GAAACACCAT TGCGGCTGaA 
mACCTGCGGT GCATcGTGcA AtGCGTTGCC ATCGTGAGCT TTTTGCTATC CAACGCCGCA 
GAATACACCA GAGGCTTAAA TACACTGCCT GGCTGCAACA ATCCTTGCGT TGCACGAATC 
ACTTGGTTAG AAGCACCGAA TCTGCTGCCC CCTACGAGAG CTGTAATGTA TCCGGTATCG 
TTCTCAAGCG CGATGAGCGC ACCTTCCACC CGCTTACGTG CAATTTCGTC GCGCACCAGA 
GAGGCGCCTT TGTCAGACAA CGTTTTAAGA TTGTCGAGGC CGAACATCAG CGCCATAACA 
TTCACCAGGG GACTGAGCGT ACTGCGGTAA TaCGCGCCGC TTTTTGCCTT CATGCGCCGG 
TCACCGACGT GCAACTGCGG AACGTTGAAC ACCAACCCCA ATAGCTCGCT GATATTGCTG 
TACAGTTCTC TGCGCGCAAC GTGTGTGAGA GAAGATTTTT GCACGTGTGC ATTGGCCTGC 
TCCAGCGTTT GCTCAACTTG CTGCTCTGCA ACCAACTGAT GACGCAAATC GCACGTGGTA 
TGCACGGTAT ACCCGTCCTG GTACAAATTC ATCGTGCCGT ACATCATGCG GTCCAGCTGC 
CTCCGCACAT ACTCGGAAAA CCAACGCGCC TTATCCGCAC GGGCATAGAA CGCAGAACTT 
GTGGTGCGAG TGTAATCGAA ATGCGCCCAG TAGTGCTCGT AGGAC TCATC CCGTTCTTGT 
TCACTGAGAT AGCCAAGgCG CGTCATTTCA TGGAGTACGT AACGCTGACG GTCTTGAGCG 
CGGTTAGGAT ATTCAAAGGG ATTGTAGTGT GCCGGGTTAG AAAGCAAAAT AACCAAGAGC 
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GCCGCCTCTG CTGCGCTCAT CTGACGT AC C 
CCTACGCCGT AGTGCCGCCA CCGAAGTAGA 
TCTTGGAATA ACGCCGCTCC ATATGGAGTG 
GACTGCGGTC GCTGCGGTCT GAATAGAGAA 
CGCCCCCTAA GGCGCGACCG GTGAGGGTGC 
TGTAGCCGTG GTGGGTATAG AAGGAGCGGT 
GTGAAGACAA GTCAGCAAAG GAAACGATTT 
TCAAATCACC CCGAATGTCC AGGATTCTGG 
GTTCGCTCTG CTTAATGTTT TCAATGGAGG 
CCACCAACAG ACCGCACAGC AACACCAGAT 
AGCGGAACAC AAAAAGGCGA AAACTTCAAG 
ACCGCACGAT GCACTCAGGC AGACACACAA 
ACGTACCGAC CGCGCGATAG GTGGGAGGGG 
TCTACCTGCA AAACCTTGAG CCTGGTGCGA 
GGAGGAGCGC GTCG AC AC CT CACCTGTCGG 
TTCCATGAAT CGCTCAGGAT TTTGGGGTAC 
TGGCCCGGTC GATGCGCCTG TCGATCCCAC 
TTGCTGTACA CGGACGCGCA CTGCACTCAA 
GTGCTGCAAA ATCACGTACT TGCCATACAG 
GCTCGCACAC GCATACACCA GCGCGCCCAT 
ACTCAGCCgC CCGGTGAACG GGCTTTTGCG 
TGCCAACGGA AAACGATAAA ACGGCTTAAG 
GGCCTGAGGC GCGCACACTA CCTCCCGCTT 
CGGAAGGGAA AGAAAAAAAG ACGGCCCTTG 
CTCCAAGGGA AGGTGCGGAT CCGCAGAAAC 
CGTACGCCCC TCAAGGGGGG TATGCAGCGT 
ATAGGGAATA CCGCAGCGCG CGGCGATACG 
TGCATAAAAA CGCAAGGGTA AATCGTTTCC 
aGCAcGtCAT CGCTATACTG ACGAAAGAGC 



YCVrm ■B3041 

688 

GAATGGCCAA AGTAGAAGCG GGCAGCCGCT 33 00 

CGCGGTTCAA ATACAACTCC ATAATTTCGT 33 60 

CCCACCACAA CTCTTTGATC TTACGCCTGA 3420 

GACCTGCTAT CTGCTGGGTC AGCGTACTCC 3480 

CGACAAGGGC ACGGAAAATA GCCTTGATGC 3540 

CTTCGCGGGT GAGTAGAGCG TGCACAAGGT 3600 

CGCGCTTTTC GTCTGAGGAA AACTCAGTGA 3660 

TGGGAAGCGC CGGATTAAAG CGGGTGAACC 3720 

CAAGCAAGAA CCCAAAGAGC GCAGcTCCCC 3780 

ACAGGTAACA AACGCGACGC ATGGGCCAAG 3840 

GCAAGAGGCA GAAGACCGTC AACGGCGCAC 3900 

AAAGGCCGGC TTTCTCAAAC CGGCCCCACT 3960 

AACCACGCAG CCGGCGCCCC CAGTATCGGT 4020 

ACCCCACGCC CAACGGTAGT GCCACAGAGA 4080 

CGCCCCGACC GTCACGTATC AGCGGAACTT 4140 

CCCCGCTTCC CGTACCTCAA AATGCAAGTG 4200 

ATTGCCGATC ACCGCTCCCA CACTCAGTTT 4260 

ATGTCCGTAT AAGCTGTGCC TTCCGTCTGT 4320 

ACGATTGTAC GCAATCGTCG CCACCTGTCC 43 80 

GGGAGCGGCA AGATCTATAC CTGGATGATA 4440 

TGCGCCAAAC CCTGAGGTGA GTCTTCCGGA 4500 

AAAGAAGGCA CGCACCGTTC CGTCAAACAA 4560 

TTCTCGCGTA TGcgTTGTCT GCGCAGTCCC 4620 

ACCCTTTTTT ATCAGTGCGT AAATGAGCCG 4680 

GTACAGTCCC GGAACGnTTG GCAAAAGCAA 4740 

CTCAATACGG TTCAAACTAG CCACCGCGTC 4800 

GATGATGGTG TCTGCCTTTT TTACGCGATA 4860 

CCGCTTGCCT TGCGCCAATG CCaCGCgCGC 4920 

GCATCCTGCC CTTGCAGcTG CGCTATCAAC 4980 
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GGATAGGGAC CGCACACCGC AGGATGTGCC ACACACACAC ATAGGCAGGA GCACACgcTG 
CGCGCACCAT CCGCACGCGC AGGAGAAAGG AGAACGGAGA AAATGGGCAC AGACGCATCA 
CACGCAGCCG CTTAGCACTC CTGCGCGGGG CGAACTTCCG CGGAGGAAGA GAGCCC CAT A 
GCACCGACTT CCCCTGCTTG CACTAAAACG CATGGCTCCT GCGCGGCGGC CCTACGcCCT 
GCACGTCGCA CCCGTGCATA AGAAAGArCG TTGCACACGC GTcTCAGTCC CCACAGGACC 
GCACCGCCGC CATAGAGCAA ACCGCTCAAC ACCGAGAAAA CACGCGGACG ATGCACCGCC 
AACGCCCAGA GAGGGTAGAC GAGCGCGCAG CCGAGCGCAA ACGAAAGTAT CAGCAAGCAT 
CGCAGCGAGT GCTCCAATGC 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
GGGATCAGAT TGAAAATGCA GAATATTTTA GCGCACACGG TGCTGCGTGC ATCCTTCGTG 
CTCAAGATGA AAAGGGACAT CAACTGGTGT CTTTGCTCAC AGAACTGTTC CATCCTTCCT 
GCGCTAGGAT AGAGGAGATG GCACGTGCGT C TTACAC AC T TGGAATTGGC AATGCCGCGT 
ATGATATTGC GCAGCAGTTG CAGACTTTCA TAAAGGAGGG GATGTGACAA TCAGTACCCT 
GGATCTCATT CTTGGAATCA TCATGGGGAT AGTGACCGTC CGTGCCACCA TGCGCGGGTT 
TGTCGATGAG TTCTTTTCTA AGGCAAGTAT CCTGTGCGCA gCAGTTGTTG CAATACTGTG 
TCATAAAAGG CTCGTGCCCT TGACACGTGT GTTGTTAGGC CACAGTATTC TGCTTCCGTG 
TATAACGTTC TTGATTACCT TTATGGGCGT CTATTGCGTT ATGCTCTTCC TCCGTTCACG 
TATGCGCACG TaTGctACGC GCGATCTTAT CAGTGGTTTT AATCAGGTGT TTGGCTTTTT 
TTTCGGGATA ATTGAAGGGA GTGTACTACT CACTGTTATC CTTTTGCTTT TACACGTGCA 
GCCTTTTGTA TCTGTTTCGC ATATGTTGCA TGAAAGCGTA ATTAACACTG TTCTCTCTCC 
CC TTGTCTTA GATGGCGTTC GCTATATGCG CCTGAAGATG TAGGCGC GTT TTCGTGTTCG 
AGAACATTGT CGGACAGCCA GCTACTGACC TTCTGCGCGA TGATGTTATG CACGCGCGTC 
TTCCTGCTGC GCTCTTATTT GTTGGACCAC CCGCTAGTGG CAAGCTAACT GCTGCACTTG 
AGcTTGCGCG CGTGCTTTCC TGTACGCAGG GGGGCGTGTG GCAGTGTCCT TGTGCACCCT 
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GTGTGCAGCA TACGGAGTTA CTTTCTCCAG AACTACTCGT GATGGGGATA AAAGATCTGA 
TCCTCGAAAT ACGCGCGTCG GCACGTGCCT ATATGTCAGT GCACACGCAG GGTACGCGCT 
ATCTTTTTGT GCGGGCAGTG CGTAAACTCA TTACACGCTT TGACGAACGA CTGTGGGATA 
GTGATGATAC TCGTTTCTCT GCCGCGGTTT CGAGCATTGC TGAGCTCGAC CAGGAGCTCG 
CATCACTCCC CGcACAGGGA GACCGACGTA CCACTCCAGA GCAAAAAGAA AGAGTGCAAA 
GGATATGCGT GATCGCCGAA AGGCTAcAGC AGGAGTCACT CTATACCCAA TTACCCGTAC 
AGCAAATACG AAACGCAATT CAGTGGGTGC GTCTTACACC CTCAGGAAGA AAGAAAGTGC 
TGATCATAGA ACACGCGCAC GCGATGCATG AATCTGCACG AACTGCTTTT CTAAAGATAT 
TAGAAGAACC TCCGC GCGAT ACGCTTTTTA TCCTCACTAC CGCTACAAAG TATGCAATCA 
TGCCTACTGT CCTTTCGCGT GTGCGCAGTT ATTCTTTCAG AGAAAGAAGT GTTGAACTCC 
AATGTGCAGT AATTACACGC GTATTTCATG ACAGACCGAC CGATGCAAAA AACACACAGG 
GTGTACTCTT GCACTATTTA TATCAGTTTT TGTCGTTTTC TTTAGAAGAG GTGCAAACGT 
CCGTGATGTA CTTTTGGCTG TATGTGTGCC AGCACGCTAG GTGTATCGGA CGCATCATTC 
CTGATACTTG GGTTTCGGTA GGTACTCAAC CTCAAGTATC AGGAAT sG AT CTTACCTCAC 
TTGATTCTAC GCTGCATTTT TTCCAAGCTC ACAAACAGCA ACACGCCGTG TCCCTTTTTT 
TCTCCTTACT CGTCAGGCAC ATGCGCACCC TGCAGCGCAC CACCGAGTAC TCCGCACGGA 
ATACAGAATG t TCGCACAC A TTGCTCACTG TATCGAACAG GCACATCGCA ATGTGCAGTT 
GTGGAATCTA ACAATACAAG GGACTCTCGA ACACTTAGCA CACACCATCG CGAATCATCT 
ATGAGAGATT TTATTGCACG CGCGCTAAAA AAGTCAGCGA AAATGAATGA CTCTCAACTG 
AGAAATATGA TTGAACTTAT TGCCAACGAG TACACCTTGT TGGATGCACT TATGGATTCT 
CTGAATTAcG GACTTATCGT GTTGGACTGT TTACACATTC CATTAAAGAC AAACCGAGCA 
ATTGCACGGC TCTTGGGTAA ACCACTGCCT TCAAATCCTC GCAGGCCACT GTGGCATTAT 
CTTGATGACG AACACATTGC GCAGTTCATT GTGGCAATTA TTAAAAATGA GGTAGGAAAA 
GCACGCGCAG AATTCATTGT ACAAAGACAA GGTGAAACAT TGTATCTGGA AGTATCCTTA 
TTTCCGCTAA TTTGTGACCA AAAGATCCGC GGAAGTATTA TCGCAATACA TGATATCACA 
GAGAAAAAAC AAGAAGAAAT CTATAACCGA AGCTAGAAAG TCTTGCAAAT TTAACGAATC 
TTGCAGCAAC CGTTGCGCAC GAAATCAAGA ATCCCCTAGG AGCAATGAGC ATTCATCTGC 
AATTACTACG TAAGAATTTT AGTACCTGTA GTTTCGAAAC AAATAAAAGA ATCCAAAAAC 
ACCTCCATGT GGTAGAGGAG GAAATCGAAC GGCTCAATAG AATTGTCACC GGCTTCCTTT 
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CTGCAGTTCG TCCCTTAAAA CTAAATATCA CACGGCTGAG CGTTTTTGAT CTTGTTACAT 
CCATACGAGA CACATTTATG AAGCCTTCAC CAAAGCAGAA CTGTCTTTCT CTGTACATAT 
GCCACACAAT CTTCCCCACA TACGAGGCGA TGAACACCTG CTAAGACAGG CATTGGTAAA 
CATTATCACT AATGCTAAAG AAGCCATGCA AAGAGGAGGG GCCCTTGAAG TCTTTGTCCA 
TAAACAAACT GACCACATCA GTATCAGTAT TTCGGATACA GGAGAGGGAA TTGATGCCCG 
AAATATTCAC AATATTTTTG AGCCGTACTT CACTACTAAA ACTGAAGGTA CGGGGTTAGG 
GTTAACCTTA ACGTTTAAGG TGATTAAAGA ACATGGCGGT GACATCAGTG TGTCCTCTAC 
TGTTGGACGG GGTACGTGTT TTACTCTCCT TTTACCCATA GATAAATTGG GACGATCGCT 
TTTACAAGAA AAAATATCCA CCCACCTAAG ACATACGAGT AAAGAATAAG GAAATGCGAT 
GAAATTCAGT ATTCTCGTAC TAGATGATGA AAAAAATATC CGTGAAGGTT TGCAAATGGC 
CCTCGAAGAT GAAGGATATG AGGTGTTTAC CGCAGAGGAT GGAAATACAG GGGTAGAGAT 
TGCCCTCAAA GGGGATATCG ATCtTATTAT CACTGATTTA AAAAtGCCtC GTATGAGCGG 
GGaATTGGTG CTGCaACATG TGCACGCGGT GTTGCCCGAT ATTCCTAtCA TTATTCTCAC 
CGGGCATGGC AC AG T AGAAA ATGCAGTTGA AGCAATGCAC AAGGGAGCTT ACGATTTTTT 
AACTAAACCA TTGGATCTTA ACCGATTGTC TTTGCTTGTG CGCCGGGCGC TACAAAACCG 
AGAGTTGATC GTTCAACATC GAGAGTTAAT CAAACAAATA GGAAATCGCA CCTCATTCGA 
GAACATTGTA GGAGAAAGTC CTGCAATGAA CAAAGTGTTT GACATGGTAA AAAAGGCAGC 
CGCCTCAAAA GCGTCCGTGC TCATTACTGG AGAAAGCGGG GTCGGTAAAG AACTTATCGC 
GAATGCAATC CATAATCTTT CGCCGAGGAA GGCAAAACCT TTAATTAAAG TACACTGCGC 
TTCTTTTGCA GAAGGAGTGT TGGAAAGTGA GTTATTCGGT CATGAAAGGG GTGCCTTTAC 
CGGTGCGGTC AATCGCATGA AAGGTCGTTT TGAACTTGCG CACGAAGGAT CAATGTTTCT 
TGATGAAATC GGAGAAGTAA GTATGGCTGT GCAAATAAAA CTACTCCGTG TGTTACAAGA 
ACGTTCATTT GAACGTGTAG GTGGAAGAGA AACAATAAAA GTTGATGTAC GCGTAATTTC 
TGCAACAAAT CGTAATCTTT TAGAAGAAAT TAAACGCAAT TTGTTTCGAG AGGATCTTTA 
TTACC GATTA AATGTTGTGC ACATTCACGT tCCTGnCTGC GCGAGCGcAA GGAGGATTTG 
CCATTACTGA TTGCAACATT TCTTAAAGAG ATTGCAGAAG AAAACGGTAA AAAAATTACC 
TCTATAGATC CTCAGGCCCA GTCTGCACTG CACGCGTATG ATTGGCCTGG TAATATTCGT 
CAGCTGAGAA ACTGCATTGA AAGCGCTGTC ATTATGAGCT CAGGTCCTGT TATCCACATA 
GAGGATCTCT CAGAGCCAAT TCGATCTCTC GGTGAAACCT CTTCCATACG CaTTCCTATA 
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GGAGTGAGCa TGGAGGATGC a GAAAa GGAA aTCATCCTCC AGACACTGGA AGCACAAAAA 4440 

GGTAATAAGA GCAAAACCGC AGACGTGCTT GGCATTGGGA GAAAGACGCT CTATCTAAAA 4500 

TTAGATCAAT ACACGAATAC AAGCTTTGAA CCTGATGCCG CAGCAAAATC ATGaAACGTG 4560 

cTTTGATAAT CACCGGAGGT GAATATGCAC CCTATGAGTT TGTGCAATAT TACCTGCCTG 4 620 

CGTACGATCT GCTCATTGCC GCTGATTCAG GGCTTGATAC CGCATTGCAA TTTGGTCTTG 4680 

TGCCCGATTT TGTTATTGGA GATATGGATA GCGTTAAGGA CAACCTGTTC ATACAGGCGT 4740 

GTGATAAAAC GCGCACACAC CTTTTCCCCC GAGATAAAGA TTTTACTGAT ACTGAGCTTG 4800 

CAGTCACCCT TGcGCACCAA TTGGGAAGCG ACGATTTGAG CATCGTCGGA GGGGGTGGGG 4860 

GAAGGGCAGA TCACTTTTTA TATTTCATGC GTCTTTTTGC CGCACCTCTG TCACCGCGTC 4920 

TGTGGCTGTA CAGACATGGA CTGGGATATT GCTTTGGGGA AGGATGTGTT ACACAACAGT 4980 

TATGTATTGG AGGAGTGGAT AATACTTCTT TTTCTTTCTT TCCCGTTGGA GATGCTACAG 5040 

ACTATTCGCT CTCCTCTGAA GGATTGCATT GGCCCCTCGA TGGGgTGCCg TGGCACACTC 5100 

ATGTAAGTAT GAGTAATCGC AGCAGCGCAC CTGTCGTGCG CGTCGAAGCA CACCGGGGGA 5160 

GATTTTTGCT TTTCCTTTCT CCCCTCGGAC GTTACACCAT TGATCATCAC GAGCGGGGTA 5220 

TTGCGTGCAC GCACAGAACG TAGATATTGC GCCGGGCAGT ACCTCGACCG TTTCCATCAT 5280 

AGTGGGTATT GACCCAGGAC TTGAATCTAC CGGATACGGC GTTATAGAAG CAGGGGGAGG 5340 

CAGTCTGCGC TGTCTTACTA CGGGGTGATT GTTACCCAAA GCAATCAGCC ATCTGCTGCA 5400 

CGACTCAGAC ACATCTTCGA TACCCTGCAA CAGGTAATCT CAATATATCA ACCTCAGTAT 54 60 

TGCGCAGTGG AGACAATCTA TTTCGCAAAG AATGTAACCA GTGCGTTGTG TGTTGCGCAA .5520 

GCGCGTGGGG TTGTATTACT TGCTATGGCA CAACAGCACA TTTCAGTAGC TGAATACGCA 5580 

CCGAATGCGA TTAAAAAAGC AATAACTGGT ATTGCCCAAG CAGAAAAAAG ACAGGTACAG 5640 

CATTTGGTAA AAATTTT AC T CAATCTTAAG GATAT AC CTC ATCCTGATCA CGCTGCTGAT 5700 

GCCCTAGCGG TTGC TGTTAC CCATGTACAC TGTTGTATGT CTTCAAACTA TGCGGTAGGT 57 60 

TCAACGCGCT CTAGGGGAGC GTACGTTACG CTGTACAAAA AAGGTAAGAG ATGAAAAGCA 5820 

AGAGTTCTTT GTTGAAAAGT GGGTTGCTGC TTTCTCTTTT AACACTTGTC TCTCGTGTAT 5880 

TGGGTTTAGC GCGAGAAGTA GTGAAGTCTA CGCTTATGGG GACCAGTGCG ACAGCAGATG 5940 

CATTTACCGT TGCATTTATG ATCCCAAACC TTTTCCGCCG ACTGTTTGCA GAAAACGCCA 6000 

TAAGTGTTGC CTTCATTCCC GTCTTCACAC AGCACTACTC AATGCCGAGT TCAGCGCAAG 6060 

TGCCATGTTC TTCTAAAACG AAGGAGTTTC TTTCAGCTAT CTTCACACTG ATGAGTAGTG 6120 
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TCACTGCAAG CATTTCTCTT ATCGGTATAC TCGGTGCTCC GTACATCGTG CGATTATTTG 
ACACTGATCA GTCATTAACC GTTTCATTAA CCCGCTTGAT GTTTCCCTAT TTATGGATGA 
TCTCTCTCGC AGCTTTCTTT CAAGGTATGC TGCACAGTAT TAAGGTATTT GTCCCCTCAG 
GATGTACCCC AATATTTTTT AATGTCAGTG TCATTTTTTC GATGTACTTT CTGAATGTGT 
CACATATGAA CGTGGCTATT GCTGCAGCAA TAGGTGTTCT TATAGGAGGA TGTGCGCAAG 
CACTCTTCCA GCTAATATTT GTATATATGC ATGGGTTTCG TTTTACGCTC CAGTCTCCTT 
TAAAAGCAAT GCACGATGAA GGTGTGCGAC GAATCATTGC GTTACTTCTA CCGACAACTG 
TTGGCATTGC AACCTATCTT CTAAATGACC TGGTGTGTAC TGCGCTTGCA ACCTCTGTTG 
AGATAGGAGT TGCTGCGAGT GTGCAATATT CAtTCGTATA CAAGAACTTT TATTAGGrAT 
ATTtATCGkT TCTCyAAGCT CyGtGGkACT TCCTGAtCyT TCyTTCCaTG t TATGAGAAA 
AGATTGGCAA TCGTTTGAGG ACCTCCTGAT AACA 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 104: 
GCCAACGCGT TGGCGGCCCT ATGTGCATTT CCGTTCTGCT TGCGTGGTAG TGGATTGTGG 
AACAGCGCTC ACCTTTACGG CGGTGGATGG CACGGGGTTG ATTtCAAGGG GTGGCAATTG 
CGCCTGGTCt GCgcACTGCG GTGCAGTCTC TC CAT AC AGG AACGGCACAA TTACCACTTG 
TTCCTCTTGC CCTGCCtGAT TCCGTTCTGG GCAAGGATAC TACGCATGCG GTGcAGGCGG 
GTGTGGTGCG GGGCACGCTC TTTGTTATTC GCGCTATGAT TGCACAGTGT CAGAAAGAGT 
TAGGGTGmCG CTGTGCAGCG GTGATAACGG GGGGGCTTTC GCGTCTTTTC TCGTCAGAGG 
TGGACTTTCC TCCTmTCGAT GCACAGtGAC GCTCTCAGGT CTTGCACATA TTGCGCGGCT 
GGTGCCGACA TCTCTCCTGC CACCTGCTAC AGTGyCaGGT TCATCGGGGA ATTGAGGAAA 
CTGTTATCCG CGCTCCCCAT CTTCCGATAC TGGATCGGTG TCGGGGGGAG TAGGAGTGGG 
GAAGCGTCTG TGCTGTATCG CGCTGGTGAT GCGCGCGTTC TGGTACCTCa kTsCGAAGGG 
AgTCAGTaTc GCTTACGTGC CCgTTCaTcG CAGTGGGGGC TCTCAAGATT cGAGCATGAG 
CACAGCAGTG GGCGATACGC TCyTTAACGC CTTyTTCGAC GAGGrAATGG TGGTTACGgC 
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AGTACCGCCG GGTGTACACG ACGGCCAGAC TATAGCAGAA ATTGCTGCAT GTTTTGAAGT 
AATGCCCGAT TACGCGTTGT TGGTGCAGTT TCATTCCGCT CGTCTCCCTG GTGGGGAAAG 
CCCTACCTCC CGTGCCCGCG GCGCTTGGTC TTCAGAGAGG * -CCGTGCTG TGTGGACATT 
AGTGGATTTG CATACGCAGC GCGCGTGTGT CTATGCGTGT GTCGCCCCAT ACAGGGAGAG 
TATTCCCGTT TCTGAGTGTG TTGACGTCGT TACCCGTTGT ATTGCGGAgC AGGcAATTTC 
GTACATACGG GTGGGCACGA GCACCGATAC AGCCGGAGTT CAGTTATAGA AAATAGGGAA 
TACGTAAGGT GTCTGCAGCG TCGCTTCAGC TGGGAGGAGT CTTATGATTA AACGCCACAT 
GTTCGCAAAA AGGGGTGTCA AAGGAAGATC TTACCTGGTT AGGGTGAAna inTGCGTTCTT 
AGTGCTTTGT GTTGCTTCTG TCACGCCGCT TTGGGCTGTG TGGGAAGGGA ATGCAGAAAT 
TGGCCCCCAG GGAAGTTTTC TGCAGGACGG CATGTTTGTG CGCAGTGACA TGTTCCCCAA 
AAACACTGCT GTTGAAATTA GCAACTTAGA AAAGAATGCC AAGGCTCAGG CAGTGGTTAT 
TGGGCACGCA GGGATCCCCG GTCTTCTAGT TAGCCTTGCA CCCGCTGCTG CAGCACAGCT 
TGGGATTGGC GTATACCAAG CTGTGCGTGT ACGCGTACGT ACCTTGGGTA CCGTGCGCGG 
TGGGTCTCAA ACAAGTCAGG ACGGACTGTC CCTTGCATCT TTGCCGTCCC GTGTGCCTGC 
GCGCCCCGCC AGcgTGATCC TCTGTCATCC CCGCCGGCAG GTCACACTGT ACCGGAATAT 
CGCGATACGG TTATTTTCGA TGACCCGCGT TTGGTTTCCC CTTTGTCTCG TGAGGTGGAG 
GACGCGCCGA AG t AGTGGAG CCGGCCTCTG AGCGTGAGGG AGGGGAGCGT GAGGTGGAGG 
ACGCGCCGAA GtAGTGGAGC CGGCCTCTGA GCGTGAGGGA GGGGAGCGTG AGGTGGAGGA 
CtGCCGAAGg T t AG TGGAGC CGGCCTCTGA GCGTGAGGGA GGGGAGCGTG AAGgTGGAGG 
ACTGCCGAAG GTAGTGGAGC CGGCCTCTGc AGCcGTGAGG GcAGGGGAGC CGTGAGGTGG 
AGGACGTgCC GGGGGTAGTG GAGCCGGCCT CTGGGCATGA AGGAGGGGAG CgTGAGGTGG 
AGGACGTGCC GGGGGTAGTG GAGCCGGCCT CTGGGCATGA AGGAGGGGAG CGTGAGGTCG 
CTTCTCAGCA TACGAAGCAG CCATCCCACT CGGTTTCCAA CTCAGCTCCC AATCAGTTTC 
GGAACCCTGA GGGGGAACTC CCCTTTACGC TCCCTGACCT ATCCGAGTCA GAAATTGTGG 
TTCCGGAGGA ACAGAAAGGA CGTGCGCATC CCCAGGTGAT ACCCGAGGGT GCGCCACGTG 
GACTGCAACC TGGTGAATAc TACGTACAGA TTGCAGTCTT TCATGACGCT ATCCAGGTGC 
AGAGCATTGT CCACCGTTAC GGGGTAGAAT ACCCCATCGC AGTGGAGCAG GACATCCATG 
AAGGTAAGGT GCGTTTCACC GTATGCGTCG GTCCTGTCCA AAAAGACGAA CGCGGCGCG t 
ACTAGAGAAC TTCCAAAGGT TTGGATTCAA GGACGCCTTT CTGAAAAAGG CGCGATGATC 
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AGGTCGGCCC TCCTCTTCCC CTCGTGACCG TGGTGACTCG CCCCGAAgGc GnCAcAGAGC 
CCGAAGaACG GAAGGGAAGG GGCAGACTTA ACTATTTCTT TGTTTTTTTG AGCACGTAAA 
ACGGCGCCAT CTCCTTTGAA GGCTTTCCTG CGCCGGGAGC GCCCATGTAG CGAACGGAgT 
TACTGTCTAT CAGCTCGTAC AGCTCTTTCT CGTGCGGTGC CTTCGATTGC TCCGAGGACA 
CAAGCGAGAG TTCGACAATT CCGTCTTCAC G T AC CATC C A CGTACCGCGA TACGTAAGAG 
GAGAAGGTGC CGACTTCTTC TCAAGGGCAA GCTCTACCTT TTGCGCAtGC CATCCGCGTT 
GAACGTCACA GTCGTATCGA TTCCCGGGCA ATCGGCCGCA GTAGCGTACC CCGAAAGATA 
CCTCCCTTCA ACGCGCACTC TAsCTTTTCC GCTTtGGCCT TCCCGGCGTG CGGACACAgG 
TTGTGCACGA GACACACAAA GCGCTAmCGA GCGCTCCAAC ACCAAGGAAC GCGCACAGgs 
CGGsACAGAT CtTTCATCAC AGAAAACCCC CTTGTCACGT CTGTAAGnTC AGGGGAGAAA 
AGCCCAACGA TGCAAAAGTT ACGCTCCCTC TTGCCAAAAG TGAGAGAAGA GCGCAGGCCA 
CgCGCACGgr GGCAAACGTG GGGTTTAGGA GCGCACCTGC GCCCCGGCCG CGGGTGCACG 
CATCTGAGCC TCAAAGGCAG CAACAAAGCG GGGTAACAGT TCATCAACGG TGCCCTCGAT 
ACGAAGGC C A GCAGCGCAGC GATGACCTCC ACCCCCGAAG CGAGCGGCGA TAACGCTTAC 
GTCAATCGAT CCTCGCGATC GAAACCCCAC CGAACAATGT GTGGGAGATT CCTGACGTAC 
CACCACAATT GCCTCGACGC CCTGGATGCT CTGGATGAGC TGATACAGAG CATCGGAGTC 
TCGCACATCA AGCCCCAACT GGACCGCGTC TTCGCAAGTT TCGTACGAGG TCATGAGCGC 
GCCACCATAA TACGGCGTCA GTCGAGAAAG CACGCGCGCA ATCAACATGC GCGAAGCAAG 
GGATCTCCCA CCGTTCATGG CAAGAAACGT GTCTTCGGAT TGGCGCCTGC ACGTACAAGA 
CGCGCAGCAG AAGCGAAcGT GTCCGCACTG TGCTCGTCGA GGTGGCGAAA AAAACcTGTG 
TCCGTAGCCA ATCCCAAAAA AAGTGCGCGC GCCTCGGCTG CTTCAAGAGA TCCGGCCATC 
GTCTCGATTA ACGTTTGCAC CAATGTAGTG GTGGACGGAG CTGTTTTTAC GACGAACGAG 
TGCGCGCAGT GGTCGCCGCA CGTTTCGTGA TGGTCTATGA ACGCGCGCGC AAAGGGGGCA 
AGCTGCGAGG C GAG CTCAGC GCCAACGCGG CTGAGCTCAG AGCAGTCGAC CACGATGAcG 
gCCGTCTGAT CAGACGGCcG TnAtCTGCGC AGAGAGACTT GGACGGAAGA GTGTCGCGTA 
CGCAGCGATC TCTCTACGCT TGAAAGGACC CGCAGACAAA AGCTCAACCT CTTTCCCTAT 
GCGTCTTAGG AACGAGGCAA GCGCAAGACT GGAACCTACA CAGTCCCCAT CCGGCTTCTC 
ATGCCCCACG ACCGCAAACG CGCGATGCGT GcaATGAACT CGATGAGCCC GGAAAGACCC 
CTCCCCCGAT CCCCTGTACG CGCCGCGCAC GGAGAGGGGG AGGAAACGGA CTTAGGAAGA 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 



Printed from Mimosa 02/03/22 07:28:50 Page: 697 



WO 98/59034 




PCT; 




696 



GAACTCGGGG GACTGGAACT GAACATCAAA CTTATCGTCG TACTCAGAAG GACTTTTCAG 
CGTTCCCCAC ACCGGGCTGC TCAGGCTGCG AGGGAGTGTG AGGTTTACGT AGGTAAGTGT 
GTTCCCTTCC ATAAAAAACG AAAGGTAGGG ATCGACGCGC ACTGGAGcGT GCCGCAAGAG 
CGCCTTACCC ACCCTGGGCG AGAACCAGGC GGACGGCAGA TAGATCTCGG CCATTTCGAA 
GTTAGCACGC CGGTAAATAA CCCGATAGCC CAAGCGGTGA GAGAAGACTT TCACCACAGG 
AAGATTGACG TAgTACATAC GAGAAGAGTT GCTCTCTGTG AGCCCTGGGG GAGGGGGCTC 
TAACACACGC CCAGCAGTAT CAGATGCATC CTGCGCACTC AGAGCGACAG CAGAAGCGCA 
CAGCACAAGG CGTAGACAAC GAGTTGAAAA TTTCATAGCA AAACCTCCGC GGCGGCAGGA 
TTCTAGCCTA GGTGTGTCCC TTAATACAAG CGTCTGTATC ACGCTCCTTG CAGGTAACTG 
CGCCGTGAAA AATGCGCGCG GTTGATTTCT TGCG TAG AGC TACAGTATGC TGGTACATGC 
GCATTTATCT TGCGTCAAAC AACGCGCACA AGCACGCGGA GTTCTCCTCA CTTTTCCCCA 
TGCACAC GAT TCTCCTGCCG AAAGACGAGG GTATCGATTT TTTCTCGCCT GAGGACGGGT 
CTACTTTCTT TGCTAATGCA AGGCAGAAGG CTGACGCCCT CTATGACGTG GTACATGCGC 
CTGTGCTCGC CGATGACTCA GGTCTCTGTG TGGATGCTTT AGACGGGGAC CCAGGGgTGC 
ATTCGGCGCG TTTTGGTGCA CAGCATGGGG TACACACAGA CAtGCGCGCA TGCAGCTCCT 
TCTGGAACGT ATGCACGGAC GGCArGACCG TGCCTGTTCC TTTGTGTGTG TGGCGGTACT 
GAAGTTGGGA TCGGTGCCGT TGTGCGTTGG GCGGGGGGTG TGCCgGGGAG TGTTGACTAC 
AGAAATGTCT GGGGTAGAAG GTTTTGGCTA TGACCCGATT TTCCTGTTGC CACACCTGGG 
CAGGACGTTC GCTCAGCTCA GCATTGAGGA GAAGAACCGC GTCTCTCACC GGGCACTTGC 
GGCGctGCGC CTCGCACAGG TGTTGGCCAT GATGCAGCTA CCCCGTkgsT GCGCTACGAG 
TTAAAGCTTT TGCGTGGTGC TCGTCGTATG AaCGCGCGGC GGCGTGCTGC GGCCTGGCGC 
CCCCTGTGCG CAACGTAAGG GACAGACCGC GCAGACTGCC CGAAGACACA AATTTTATGC 
ACGCGCTCGG AGGTGTGCCC GTCGCATACA TCGTGCCTAG TGCTCTGCAA GGTGCGTGAg 
CGAACGGAGT GCaGGACACG CGCTTGACTG CAGCTGAGAG GAGGGATTGT ACGAAGACGA 
TGTTTTTTGT ACCATGGCCC CACGCTGCAC CATCGGAGGG TCCCCATGGC GGTAAACGAC 
GAACAGTTTC AACTCGTTAC CTTCCAGCTC GGGGAGGAGC TTTATGGCAT CGACATTATG 
GGTGTCAAGG AGATTGTGAA GGTTCAGGAC GTTCGTCCTA TTCCCTGTGC GCCTGCTTAC 
GTGGAGGGCA TTTTTAACCT GCGCAg cGAG ATTATCCCTA TTATTAACCT GCACAAGCGC 
TTTCACCTAC GCGAGGCTAC GCTCGAGTCG GGCGACGAGT ATCTCGGCGG CTTTGTCATT 
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CTCAATGTGG AGGACAGTAA GCTCGGCATT ATCATCGACC GCATCGCGCG TGTTATCGCT 
GTCTCGCAGG AGGACGTGCA GTCCCCTCCC CAGGTTATCA CCGGcATCGG GGCGGAGTAC 
ATTCATGGGG TCGTGCGCCA GGGGACGAGT TATCTTATTG TTCTGGATAT CCACAAGCTG 
TTTAGCTCCA AAGAGTTGCA AAAGCTCGCG AACCTCTAGT GCCCCACCGC TGTCTGTCTC 
CTGCCTGCAG CTCCAAGCGC CGTGCGGGGT CCATCCCCGT CCACAGCCCC GCCATCCGAA 
GgTCCGAAAT GGACGCGGTG CTCACCTGCT TGGTGGATGA AAAAATTGGC CCTGGTTCGC 
TTGGCAGCAC CCTCATCCAG TTGGTGCGCG AGGTGTTTTC TCCAATCGAT GCATAcGTgc 
TGCGCAGCCC ssTATCGCAC TTTCCTTTgC ACTCCGTGCA CTGAAATTGC CTCCTGCTTC 
CCCTGTACTT CTTTCTGCGC TTGCGCCCTT CTGGCACTAC CGTGAGGTGC TTCACCAGGG 
GCTGCAGCCG CTTGTCCTTG ACGTAGACAT TCACAGCGGT TTGTTGTCCC GTGATGTGGT 
GGAAACTGGC AtCGCGCGTG GCGCTCGTGC GCTTCTTGTG CCTGAAACAC TTGGAAATGT 
GCCTCCTGCA GCGGTGTTTT TGGAACTGGG GATACCCGTC ATCGAAGACA GCTCTCAGAG 
TGTCGGTGCA GTATtGGGAG AAAAGAAGGT GGGAACcTTT GGCTCGTGTG TCATCGTGGG 
ATTGGAGGCA CACGATATGC TTACCGCAGG CGGCGGCGCG GTACTCATGG CCTTTGAGGC 
CGCCTGCGCG CGTCGGCTTC AGGCGCTTGT GCCAGAAGCG CTTGCCGTTG ATATGCTGCC 
GGATATGAAC GCGGCGCTCG CGTGTGTCCA AGTAAAGCAG CAAGAAAAAA ATATTGCCCT 
CAGGCGCGCA ATCTACGACC GATACTCCTC TGCGCTTTTG CGTACGCGTC ACGGTACGCT 
TCACCGGTGT GAGCAATTGG AACACAGTGC CTACGCTTTT CCTGTTGTCC TTGCTTCTGA 
TCTGAAGGAA GTGACGCGTT ACGTGCGGCA GGCGTCCATT GAGATTTCTC CTGCCTTTGA 
ACATTCCATT GTGGCAGCGT TTCAATTACC TGCTATGCGC AGACGGTGGC CTTTTCCGCA 
GTTTCTTCCT ACTTCTGCAT CGCACACGGC ACCTTTTCAG GGTGAGGACA GGGAGGTACT 
AGAGACCACG CAGGGCGCGG AAAAAACCTG TCAGGACTCT AGCTGGGAAA GGGAAGTGCG 
TGCGTCTGAG ATTACGCCTG AGATGTGTTG GCCACATGCA TCTGCGCTTT TGTTGCGCTG 
CGTGCGCTTT CCGTTGTACC CGCGTCTTGC GCCTGCACAC GCACAGGAAA TTGCGCGCAT 
CCTTGGGACA CTGCCGTGAG CAGCCGCGTG TGTCCTCAGC GGCCTGTTGC AAAATCATCG 
GGGGACGCGA AgGTGTTGCT TATTGTCAGC ACGTACAAAC CGCGCGCTGC GtGCTCGCTG 
CGGACGTTGT GAACTTTCTG AGCATACGTG GATTCCAGTG CCACACCATT GAGTATGATG 
GATTGAATAA AGAAAGCTGT GCTCGCGCAG GCTATATGTT TGCAGTCAGT ATTGGGGGGG 
ATGGTACTAC ACTGTTTGCC GCGCGCTGTG CTTCTCCTTC TGGTATTCCC ATACTTGCCA 



6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 



Printed from Mimosa 02/03/22 07:28:51 Page: 699 



WO 98/59034 




PCT/ 




698 



TAAATTTAGG GCGTTTCGGC TTTATCGCTC CTATTGAGCC ACGGTATTGG CAACAGGCGT 
TGAGCGATTA TTTGGCAGGG GGGGTGCGCC CTGCtGAGCG TGCGCTCATA TCGTGCACCG 
TCACGCGTGC GGGTAAAGAG ATTGCTTCGT GTCTGGCGTT AAACGATGTT GTCCTTTCAA 
GTGGACCGTC GCGCGTCTTA CCCGGGCAGA GGTGTGCTTC AACGACATTT CTTTTGGCGT 
GTATGAAGCT GATGGCATTA TTCTTGCGAC GCCTACAGGA TCTATGCGTA CTCGGCGGCC 
TGTGGCGGTC CCATCCTCGA TCCGGACCTT GATGCGTTTG TCCTCACTCC CATAAGCGCA 
CTGTGCCTTT CTAATCGTCC CGTGGTAGTT CCCTCCTCAG GGGTGGTGCG TATCAAGGTG 
TTGTCTATGC GACACAAAGA AACGGTGCTG TCTGTGGACG GACATGAATT GTGCACGTTG 
CAGGAAGAAG ATCAGCTGCT TGCAAGCAGG TCaTCGTGCA GCGCACGATt GGTTTTCTGT 
ACACCACACG TGTTCTACCA TGCACTGTGC TCGAAACTGG CGTGGTCAGG GAGTATTTTT 
TCTCGCAGGG GAAGACGTCA CGATGATTGA GCAACTTTCG GTGCGCAACG TTGCGCTCAT 
TCAATCTTTG GCGTTGGAGT TTGGTGCACA GTTTACTGCC CTCTCAGGGG AGACGGGTGC 
GGGTAAGTCA ATGATACTCG GCGCGCTGTC CTTTCTCTGT GGGCAAAAGG TAGGGCCTGA 
TCTTATTCGC AAGGATGAGA ACGAGGCATG GGTTTCTGCG GTGTTTCGCT GTGAtCACgc 
ACCGCGTGCG GTGCACACAT GGTTGGCAGA ACGGAGTATT GAGCCTGAGC ACCACCGCGT 
GCTCCTTCGT CGGGTGATGC GGCGTACCGG TCGTGGCACG GCGTGGATTC AAAACGTCCC 
GGTCTCTCGC GCAGATTTGG AGTTTTTCAC GTCATTTTTC ATAGACCTCC ACGGACAGCA 
TGAACACCAA TCGCTGTTTC GTGTTGCAGA GCATCGCCGC TTTCTGGATA CtACGGAGGA 
CTCCAGCAAG AAGTTGATGC GTTTACTGCG TGTTATGCGG CTCTTGCAGA GCGACGCGCG 
CAgcTGCAGC GGcTCGCTTC CTGTGAACAC AACCGGCAGG AGCGGCTAGA ATTCCTCTCC 
TTTGCCCTTG AGGAACTGGA GCACGCAGCG TTGGACGTGC ATGAGGAGCG TGCGTTGGAA 
GGAGAAGAGC AAAAGCTCTG CCAGCACGAA AAACTCTGTG ATGTGATGCA AAGGGTTGAC 
GCTGCAATTA GGGGGGTGGA CCTGCAAGAG GGCGCGCTGC TTTCTTCCTT AAAGAAAGCG 
CTTGGTGCAC TTGAAAGCGC CTGTGGGATT GATGGGAGTC TTGAGCCGGC GCGTGCCCGT 
TTAGAAAGTG CGTACTATGA AATCGAAGAC GTAGCGCATG TTCTGCGTAC GTATACAGAC 
GGTATTCAGT TTTGTCCCGA CCGTTTGCAG CACGTTCAGG AGCGTCTTGC GCTCATATAC 
CGGCTGAAGA AAAAATATGG AGGAACAGTT GCGCAGTTTT AGAATACCGT GCGCGTGCGC 
ACAAGAGATG CAGGATCTTT CACAGGCGGT GGGTGATAAA GAGGCGCTTG AGCAAGATGT 
TCAGCGTCTG ATGGCTCAGT ATTACACGCG GGACGTGCCT TATCGCTTAA 
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(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

CTCTTAGAAA CCGGTnCACC GATcCACACT CCCTCTGAAC CGAGATATTC GGTACGCCGT 60 

CCTTCAATCA AAAACTGCAC CGCATGCACC GTAGGAAACG CAGCTGCCGT GAACACCACC 120 

TGCGAAAgtG CGCCAAGTAG CCCTCAATGC CATACTGATT GAACTGAAAC TCCTCGCTCA 180 

GATCCAACAC CGCCACCCCT TCTTGCACGC GTGCAGAGCG CAGTCGCGTA CCTTCAGGGA 240 

TGAGCGTACG CAgTCCGCGC ACCGCTTCTG CCTCAGACGG ACCAAGGAAC AATGCGCGCA 300 

ACGCATCCGA AAGCGGCACC GTCGAATGAG GAAGCGTGCG AGCTACCTCC TCACGAGTGA 360 

TCTTTC CATC TGCATCAATG CGCACCCAGC ACAGCGTAAT TGCGCGGGAC TGTTGTGCAG 420 

AACCGGGCGC CCCAGAAACc GTGCCGACAT CTTTCCTTCG GTACCAGCGT cAsTGGCGCA 480 

GGGGATCCTG CAACAGGAGC GTCCGGCTGT GCCACATCCT GCGTAGGGAG CGTGGCTTTA 540 

GCATCACTTT CAACAGCCGA GCCGGCCTGA GAACGACGTG CAGAGTCAAA AACGCGCATG 600 

GAATCCTGGT TGCCGAACAT CCCTTGCACT GCAGCGCTCG AAAACACACG GTCAATATTT 660 

CTTCTATTGA GAAAGAAAAG CACCGCCATT AACAAGGAAA ACGCAAGCCA GAACAGCAAC 720 

CCCGCTGATG AGTGTTTCCC CTTCCCGGCC ATACAGCGCC ATCGTACAGG AATTATAAAG 7 80 

CTTTTCCAAG ACATGCTGCA CACACGTCCC TTATCCCCGC CTTGCAAAAA ATATAGGCAA 840 

AAAGAAAATA AAGCAGCCGA TATTGGGTCG TGCGCATGGC ATTCATGTTA CTGGCCCTGC 900 

TTTTCTCTTT TCGTAATGCG TCCATTCAGG CAGAGGACGC GCGCCTGCTG CAGCCAAAAA 960 

CCAACGCTTT GGATCTTGTC GTGCAGGGGG TAG ATC TTGT GCTGTTTGCC CAGGATAAGA 1020 

CGGCTATCAG TATCAGTACC CCTCCTGAAA AAGACGTGTT CTTCACAGAA CACGAAGGGG 1080 

TGCTTCGTGT CCGTACACGC ACAGAAAACG CGGAGGGTAC ACGCCGAGTG ATACGCATTG 1140 

GCATACCGCG TGCACAAACG CTCGCATGGG TGAAGATCAT TGCGACGGGC GCACATACTA 1200 

CGGTGCGCGG CGTACGCgCG GtGTGGTCAC TGCTTTTGTG CAACGAAGGC ACACTCGCCC 12 60 

TCACGGAAAG CACGCTCAAG TCATGCACGC TGACACACAC GCGCGGCGAA CTCCGCTTTG 1320 

AAGCGGCGGT ACTAAAACGG GCGTCATTCT GTTTGAATGA CGTGAACGCT CGTTTCACTC 1380 



Printed from Mimosa 02/03/22 07:28:53 Page: 701 



WO 98/59034 



• 



PCTi 




700 



TTCTCGGATC GCGCGCCGAC TACCGTCTTA TCTGTAGCCC AGGAGAACGT GCGTGGAAAA 
TTGAAGGCGC CGAACAACGA GGCGCGCACT ACACCGAGCC CGCACGGGCG AgaCGCCACA 
TGGTTATCAG CGCGAGTGCT TCGTCGATAG ATGTAATGTT CAAAGCGCCA CCTACACAAC 
AGGAAGCGGT AGACACGACA CAGAAGGGGT AATCCAGGAT AGACTGCCCC TTTCAATATC 
ACCTCAGATA GCAGATTCAC ACCGCGCACT ACTCAAGCAC GTCAGTGACG ATGCGCACCA 
CGCGTTTTTT CCCCGCGCGA ACTATTACCG TCCCGTCCAA ATCCAACGCT GACTGGTCAA 
TTACCGCACC GATATCTGCC ACGCGCTGGA GCCCGACAAA AGCTCCTCCT TGTGCAATCA 
AGCGC CGTGC ATCACTCTTC GTAGTACACA ATCCAACCTG TACAAACAAA TCAGTCACTT 
TGATCCCAAC TTGCAACGTG CACTGTGTCA GCTCGAACGT CGGCAATGCA CACTTATCGC 
CACACCCGCC GAATGCCGCG CGCGCTCCCT GCAACGCCAC CTGCGCGACA gcCGTTCCGT 
GCATGAGGCG CGTTACCTCG TATGCCAACA GCTCCTTTGC ACAATTAATT CCCTGAGTCA 
ATATCGCCTC GACATCGCGC ACAGACAAAA AGGTAAACAG GAGCAAGAAA CGCCGCACAT 
CTTCATCCGG AGTATTTCGC CAGTATTGGA AAAAGTCATA GGGAGACACC AAAGCCGGGT 
CTAAGAAGAG CGCACCTTGC TCGGTCTTGC CCATTTTTTG CCCATCCGCC CGGGTAATGA 
GCGGAAAGGT CAACCCATGC ACGGTTTTTC CGCGCACTCT TCGAACCAAA TCCGCCCCGG 
CAACAATGTT GCCCCATTGA TCATCGCCGC CAATTTGTAA CTCTACCGCG TAGcaTnCAC 
TGAGCGTTAA AAAATCATAG CTCTGCAATA GCTGATAATT AAATTCAAGA AAGGAAAGTC 
CTGTCTCCAG GCGTTTCTTG TACGCCTCAT AGGTAAGCAT TTTGTTTACA GAAAAATGCG 
CCCCAACCTC TCGCAAGAAA TCAATGTAAT TCAAATGTGC CAACCAATCA CGATTATTCA 
CATAGAACAC ATGC CTGTG A TCGAAGGAAA GAAAATGATc CAGcTGCGCA ACTATCGCTC 
CCGCGTACGC ATCGAGCGTT GCATAATCGA GCATCTTGCG CATACTGGTT TTGCCGGAGG 
GATCCCCAAT ACGCGCGGTA CCTCCACCGA TGAGCACGCA ACCGCGGTGC CCCGCATCAC 
ACAAATGCTT TAGCGCAAAC ATAGGGAGCA TGTGCCCAAC GTGCAAACTA CTGCCAGTTG 
GATCTACACC GACATAAAAG GTGAGTGGGC CTGCATCCAT ACGCGCCGAA AGCGCCGAAA 
GATCAGTACA TTGTCTAATA AAACCACGCG CCTGAAGACG CGCAAGCgCA GGaTTCATGG 
AGCCGATTAT ACCGCGCATC GCACACCCCG ATCCAGGGCA GGCGGTTCAG TCCCGAGAAA 
AAGACAAAAC GCGCTGATGC ACCCCCACCG TCGCCCCGCG TGTCACTATT CCTTTAAAAG 
CGCTGCAATC TTCGGACGAC TCCAGCGCAC CGCAACGTCA TAGGGTGTCT CTCCCGCCAC 
GTTTCGTAAA AACTTTCCAA AGCGATTCAT CGCCAGCAAA CtTGCAGTGT CTTTTCATCT 
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GCAACCTTTG CTGCGTAATG GAGAATACTT TCTCCAGCTG AATCTGTCTT ATTTACCGCA 



3180 



AAACCCACCA GCGTTTTCAA GATTGACGTG TtCTTGCTCA GGACTAACAA GGCCGGAGnA 
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(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1347 base pairs 

(B ) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

ACAGGGAGGT TCAGGCATCA GAGAAGGACG TCTGTACACT CAGGGAACAG GTTGCCGTGG 60 

CGCAGGGTCT GATTGCTGAG GTTAATAAGG AGTCTTCTTT CGTCGACTCG CTGAGTAAAC 120 

gCGTTGCgGC TGCAAAGACG CAGCTGCAGC AAGTGTCCGC TGCGATTCCT GATATGCAAA 180 

ATGCATTTAC GCGC GAGAAT ACCGCGCTTC TCCACCGGGT GCGAGATGGA GTACTTGCAG 240 

ACGTACATAA GGAATTAGCG GTGTTGCAAA CAAGGCTGGA AAAAGCGCAG GGGGAAAGCC 300 

AGTCTCTTTT TGAAGTTTCT GCAGTTAAAT TGCGTGAGTT GTATGAAGGG GCATTTTCTG 3 60 

AAGCAACTGT GCGTGCACAG GTGCTGGAAG AAAATGGATT CGGTCAGTTG AAAGTACAGG 420 

CGGAAAATCG CCTTCTCCAG TTGCAGGAGG AGTTTGAAGG GAGCCTCCTT TCTTTGCAGC 480 

AGCACGTTAT GCAGCGTGTC GAACAAACGG ACCAGCACAT CCAGGATTGT GCATCCCAGT 540 

GGTCTGTTCG GGCGCAGACA TGTGAGTCTG ATTTGAGTAT ACGTCTTGCG GACGTTACGG 600 

CGTGTGTGGA TGAAAGCGTG GCGC AAC TG A AGGAACAGAT TACTACACAG GAGCGTGAAG 660 

TGCGTGCGCA CCTGGAAGGG ATC GAACAGT CGCTTTCAGG AGCAGAATCC GGTTTACcGA 720 

GCGCGTGCAC AAGAGTGTGA CAAGTTTTCA CGAAAACTTA AATAAGATTG CAGAGGCTTC 780 

TGATGCGCAG TTACAGCAGT ACAGGAAGGA GATGGATGGA CGCTGTAGCA AGTTTGACAG 840 

AGAGCTTGAG GGTATTGATG TCCTTGAGTC TCAGTTGCaG CTTGCGCGTG AGCGTACAGA 900 

ACAGAAGGTG CGCGAAGAAT TTGAAGCGTA TGCGCAGGAT CGTGAGCGGA AGCAGTTAGC 960 

GTTTGAGGCA CAGTTGCAGC ACAGTATGGA TACGGTTGAG CACCGTATGA AGCAGCTGAA 1020 

TGACGAGCTG CGTGAGCTGA AGGCAAGTGC GTATGCAAAT GCATCCGAGA AACTGCAGTC 1080 

GGTGGAGGAT AACTTTTTTG AGGTACTTAC CAAGCGCAGm aCTCGTTGCA CGCGCGCTTT 1140 

TCCGAGTGGA GTGAAGGGAT TGAGGGTCGT TTGACGCAAC TTGCTCTAGA GAGTGAGTCT 1200 
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GCGCGAAAGG ATCTTGAGGA TACGTACCGC AAAGAtTGCA CACGCGGCTC AAgGATTTTG 
TGGAArAATA CAAgGGGCAG TGTACAAAAC TGGGAGAGCA AATCCTCGCG ATTGAATCAA 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5230 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
AACTGTTTGC GAAACGGnAT TCCAACTAAC CGACACATTC AACGTACGCG CAACTTTCAT 
CGCAGACACA AACGCTTCAT CAGCAGAAAG ACACAGTGCA AAAAAGAAAA CACCACCTAT 
CAAACACGAA CGCAAAAACT TCATAATGCC CCTCGCTCGA GATACCCTGT CAAAGTACCA 
ATGCACACCT GTCCCCTCTC CCTACTCACA ATAAGCACAT ACGTCCACGC ACCACACTAC 
CATACCCTTT GCGCATGCAA TCTCTCACCA GATGACAACC GGTGCACTTT ACTGAGAACG 
ACACGCAAAG TACGCAAAAG CTCACGCGGT AGATATTCCA ACcGCTCCCC TAACAATAGT 
CCTAACTCCT CAGTGCTATT CCATACATC a TGGAAAAGCG CGGTTTCTTC CGGACTCATC 
AATCCAATTT GGAACCCAAA AATCATCACT CCACGCCGAC GCAGTTCCTC TATCGTTCGT 
CGACTCTCAT GAGGGTATGa GGAGCATCCG TCAGTAATTA CCAACACAAC . CTTAACAAAA 
GATGTATCAG ACGACACACG TGCACGCCTG CGTTCCACGT CTGCAAGAAT GTGTTTTAAC 
ACCTCTGCAT CATTCGTCCC TCCGAATCGT CCATCTAAGT TCACCGATGC TTTGATAAgt 
GCGCGCTATT GAAGTCTTTC GATTTACTCT TTCCAAATGA CTTCACTTTT ATAAAACTGC 
TCCCAAAGTA ATACACCTCA GAATGAATAG TTGTTCTCTC ACGCCTACCT GCTGCGAGCA 
TGTCTGAGTA CTCTCCAAAA TCTTTCAACG AAAGCATACT TACCGCAAGC GCTTCCCTTG 
CACTCGCAAT CTTTTCTTTA TTCATAGAAC CCGAGTTATC AACAACGAAT GAAACTTCTA 
TCTGTGCCGG ACGTAGCGTA CACGCACGCA CTTCATACAC ACGATCATAC ACGCGCAnTT 
GtACCTCTGA TCCCTACCCC GAAATTCTTG AAAATCAGGG AAAACTGAAA CAAAATCGCC 
TACGTTTAAC CTCCCCTTCG GGCACTTCCC CTCGTAACAG TCTATCCACT GGACTGACTT 
TCCAATAAGT GATTTCCAAA ACGAGCGCAT ATCGATACGC AAACGATGCG TTGTACGCAG 
CAACCGCGCA AACTCTTTGC GTTCCTTCGG AGAAATAGAG AAGCGCTCGT CAAACTGAGT 
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GGTACAAGAC TCAAAAACAT CTTCTCTTTT TTTCAGCTCT TGCTGCCCGT CCCTTGAGTG 12 60 

CACAGCGTGC AAATTACTTT TAGTGTATTC GAAGGCACGC TCGTTTCTGA AAGGATCATA 1320 

ACTTCTGTCC GTTATATTCT TTAACTCGAA TGTGATATTT GCCACACTCA GATGCCAAAG 13 80 

CCGCACAAAA TGCTCTAAGA GAAACACAGA GACAATACGG TCCCGGTGCT TGGTTGGGAC 1440 

TACCTCGCCA GAAAGGGGGA AAAATTCTGA GTGAATAAAA TTACTCATCT CAGTACCAAG 1500 

CACTAAAGTG CGTAgcCTnC CTGTACTACA GGATCCCGTG CGATGCATGC GCGATTGCAA 1560 

CCCTTTGGCA TATCATGGAA CAGCACGCGC TCTGCAAGCG CATATGCAAA CTGCAAGTGC 1620 

AATGCCGTTT CCGTATGTTC CCAATACACA CCCCGTTCTT TTTGATACAG AGAAGCCAGC 1680 

AGCGCACTTC CTTCTGGAGT TGTGTAGTAC GGAGCTCTGT TCCGCACATA GTGGAAGGCA 1740 

TACACCGCAT CGAAAAGGTA AAACAGTGAT GCCACTTCTG TGTGGCAGTA ATGCTGAACA 1800 

AAGGCGCGAA CACGTATTTC ATCTCTCGCT TCGCGAGAra AAACTGCCTC ATACATAAGA 18 60 

CGTCGCATCC CCTGTGCAGC GTCTTCAAAA AAAGACACAC GGTTGCAGTA ATACGCGTAA 1920 

TCCGCCTTTA ATTCTCGATA GTAAACAAGC TCCCGGAAAA GgTTCCACAA TATTTCTTTt 1980 

CCGTCAAAGT CGTCCTTTAA AAACCACTGC AGTGGAACCC CTACCATCCC CTCCTGCGGA 2040 

ACAAACACAA ACCGATCAGC TTGTAAAATA GGTACACAAC GAAAATGCGC ATCCCCCGCA 2100 

AArGTGGCGA TATTCCGTGC TTCAGCGGAA AAAAAACGCG CAAAATTCTC TTGCGCACGc 2160 

tTcGCATCAC ATTCATTCAT TGTCATGCAA CGGCGGCGCG ACGATCTTAC TCAGTCTCTC 2220 

TTTAAACATA GAAGGTTCAC AAAATTCGTG CACCGTCAAA AGCCTGCACG TACCATTTTC 2280 

TGCCGCAATA AACACGTTGA ATACCGACGG GTGTTCACTC TGCAATAAGA ATGCAGAAAC 2340 

GACCGCTCCT GTTGTGTTTC CAAATGAGCG AGAATACTGC GCCGTGCCCG. TAGCATCGAA 2400 

TGACAGCAAA CGCGCCTCCC CACCCCGGCC GAAAACAAGC ACCATGCGTG AAGAGAGCGC 24 60 

CAACGCGCAT AACACTACAC TTTTGAAACC TGTAATCCAT TCGCCATACG CATAGTCCCC 2520 

CGCAGCCATG CCATGCTCGT GCCGAAGCAc GCGACTTTCC CCGTCATCGC CAAAGACTAC 2580 

CACTCGATCC TCAGAAAGCA CGACCGCTTC AGTGTATGGA TTTTCAAATC CATATATCGC 2640 

GTCAGAATAC TCTTCATTGT GTAAAAGGCG TGTACCGCTA AAACCACTAA ACACCACAAG 2700 

CTGCGCAGTA TTAAGAGCCC GTACTACCCG TATTCCCCCA CCTAAACCAA GAATCGGCCC 27 60 

ACGATACTCA TATCCGTGTG CACTCTTTTC TAAAATGAAG AGAGCACCTG TCTCATCACC 2820 

TACCACCACT CGGTGAGCAT TTACTTTTTC CACACACAGC ACACGCTCAT ACACGCCTTC 2880 

AACACGCGCA ACTACCTCAA ACGAAACCCT TCTTCTCTCT GCCTTCTTCG ATCGTTCCTG 2940 
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AATAGAAAGC ACGATCGTTT CaCCACtTCG GTAACGACAA CTACTTCTCC ATCACCCATA 3000 

GAGATAAGCG CACAAATACG CCAATCCGTT TTCACGCTCT TATGCGTGGA TTCCGCAACA 3060 

AGCCATTGAT CGTGCACCCG GTATAAAAGC CGTATATCAT CTGTGCCACC TGCGATAAGG 3120 

ACAGAATCTT CCGCCAAAGG AATCGCTGCG GTGATGCTTT TTCCAAATTC AAAAAACACG 3180 

TCACTGTACG TGCACTCATC TCCACAGATA TCGGGAAAGC AGTGCCGAAA CTCGGGAAGC 3240 

AGCTCCTGAA TCAATGCGTC ACTTGTCTGT TCTGCACGTG CAAGATCATC CCACAGTGCG 33 00 

CGCGGAGTTC CTCCCTCACT GCGCAATCGC GCGTGCTCAG ACACCAGTTG GTCACACGAA 3360 

CACTGCAGAT GGGCAATCCG ATCCGCACAC TCACGTGCGT ATGATTTCTC CTGCAAAAGT 3420 

CTGAATGCTG CACACACGCG TACCAACTCG TGTATCCGAT CGTGCAGAAC CAACATCGCA 3480 

CCGGCATCTA CCACATCCGG CACTGTATTC TGTGTATGCA TGGCGTACGT AGGTCCGTGT 3540 

CCAAAGAGCA ATACCACCAC CTCAGCGAGA GTTTTTACCC CATCCGGCAG ACGCGTGTAT 3 600 

TCGTAAGGAC GGGTACGGAT CTCTyCGTAA GAgTAAAACt GCCCCawTCC CtTGGGCCGC 3 660 

ACAcTCCACC cGTCyTTTTC TTGAAAAAAA CCAAaTCGGA CTGCTTGCGA TAAAATATAA 3720 

TTTTGATCGT CTGCATTCAA AACACCACCG ATAAAGCCGT CCCACAGCGC CTGATCCACA 3780 

TCCTTCTCTT CGCCCCGATT CCACTCATCA AGCACACGGA TCACATTACG AATGGAAAGC 3840 

ACCGCTTCAC GCAGCTCAGG CTCCACATCC CCAAGACCGT CATGCGTCTC ACTTCCCCCC 3900 

GTGCTATCTT TCCATTTGCC GGAAAATACC TGCTGCGTCA CCTTTGCCAA TTGCGCAAGT 3960 

TTAAACAGCT TGTCCAATGA ACCATCAATA TCCGGCAGGC AAAGTGATCC TCCAGAAGAG 4020 

GCAAGACGCG CGAGAATGAT GCGAAAAAGT TCATTTTTCT CAGGATGCGC CTGATTCGTA 4080 

TAAAGACCTT CTGTGGACTG CGGAGGGTAA TCATATTCGA ACGTGTTAAA GCGCGAAGCA 4140 

AACGCGGGGT TGAGCGCCCC AGTGCCCTCG TAATGAACAA GTCCACTGCT GATGTTCCCC 4200 

GTGGCAATTA CGCCAAACCC TGCAGCTATT TTTACCGGGC CTACCCCGGG GATATAGGCG 42 60 

AAATCCCCTA CCCGTTTTTG CAAAATATCA TTGAGTGCGA TGAGATGCTG CATGGGAATT 4320 

GCGTTGATTT CATCGATAAC AAGAGGTCGT CCCTCTTTTA CTGCGCGCAg CACCTCACGC 4380 

TCAATCTTTT GTAC CTCAGT GCCAAAATTG CCATACTTTG CAAGATAGAT ATCCAGCATT 4440 

GGATCAAATG CGTAACCACG CTCAGTAAAG GCATGCACAT CCTGGTAACA GTGTTCGGGT 4500 

GTTTTTCCCT CCAGACTGTT TTTTAACACA AGCGTCTTTT CAAGAAACAG ATCCTCTGTG 4560 

TCGATGTGCT TTGACCCCGA AATAAATAGG GGTTTGATGC GCTGCAAAAG CCGCTCGTCA 4620 

CCGAGACGTA GCGCCTGTTC ATAATATGCC GCGCGCTGAG AAAAAAGGCG TTGAAACTGC 4680 
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GTACGCGCTT CCTGGGAGTT TTGTAAGGCA TCTGACACGC CACGCTCGCT GCCCCGAATC 
TCCGCACACG AGCCACCGGT TGCAGCAGTG CACCACTcAT TAAGCGCACA ACGGACATCT 
TCAGTGATTA ATTTATGAAG CGCAAAACgc tCCGCGGCGA GCACGGCGAG CTCAGTCTTT 
CCCGTGCCGA GATGCCCGCG GAGTAGCACC GCATCCCCGC GGGCAAGGCT TGTCGCTATC 
CTCTCAAGTG CAGCACTAAC AGAGGGGGTT TGCGCAAACA CCCCCGAGtG ACGCGTTCAA 
CTTCCCCAAG CCATTGAGCA GCATGGAGAA CGAAAAAGGA TTCAGGTGTT TCCAACGCGA 
GCGCTGCTCG TTGCTCCTCC AGATCCCGTA TGGCATCTGA GTGCACTCTG GTAAAAACCG 
TACTCCGCCT ATGGATACCG AAGTGGGCAC AGGCGACTGC ACACTCCCAG TtCcGCAAAA 
AGCGCTCATA CACAATACGC GCGTCAATAT CTGCCAAACG TGCCCGATAn CCGTGCAAGT 
TCTATCCGCG 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1379 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
TTGGATTCGC CAGTCTATCC TCAAGGCACT GAACGAAAAA GCACGCATCA TCCGGTTGCC 
CGTGAGTCGT ATTCGTGCTG CAAAGAGCAC GTATGAGTAC ATACCTCCTG TTACCTCGGT 
GAATGTTGGT ATATCGAAAA CACaGGGAGG CaACAGATAT GGCaAAAGGC GTGCAACGAT 
GTGCCGAGCT GTGCCaACAg TGGAAGAAGT TCACATCTGG CACACGCCGC TATCTCTAGA 
TGCTCCGCTT GGAGCCTCGG GAAGTCGTGG TGAGTCTGGG GATGTTGATG GACTGCGTAT 
AGGTGACTGC GTGCCGGACG ATCTTTACGC GCAgcCTGAG GAGCATATGC TTGCATGTGC 
ATTGCAAGCT GATATTGCAA AGATTTTACG ACTTTTGCCT GCGCGTGATG CGCAGGTCAT 
CCGCTATCGA TTTGGACTTG GCGGGTATGA ACGGCGTTCT CTGCAAGAGA TTGGAGAAAT 
TTTTCAGATA ACAAAGGAGC GCGTTCGCCA AATAGAAAAA AAGGCTTTGT TGCGTATCCG 
TAGCTGTGCC CGTCAACACA GACTGGATTC CTACATAGCG TAGACATCAG AGGCGTATGA 
AATAGGAGCA GGACGCGTTG GGCATACAAC TTATAGTGTT TTTAGGAAAT CCTGGTGCAG 
AGTACGAAGA AACGCGGCAC AATGCTGCAT GGTTGCTTTT AACGTACCTT TTCCCATCCA 
TCGTGCTTCC TTGGCGATGC GGATGTCGGG GGTCGATTGC GCGTATTGAA GGGTTTGAAG 
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GGTCAAGCGA AGAAGTTTGG CTTTTGAAAC CGCTGACTTA TATGAACCGT TCTGGGAAAA 
GCGTAGGGGC AGCATGTGCC TTTTTGCAGA CGGATGCGAA cAGCTcTTAG TAGTGCACGA 
TGAATTAGAA TTACCGTTCG GTGTGGTGAG TTTAAAACAA GGCGGAGGGC TTGGAGGACA 
CAATGGGTTG CGCTCTATCA AGGAAGTGCT TGGTACCGCA GATTTTTGGC GGTTGCGCAT 
AGGCATCGGG CGTCCACCCA GTGAGAGTGT GAATATAGCG CAgTACGTCC TCTCTGCCTT 
TTACCCGGCA GAGATGGCTG CATTCCCAAA GCTGGGGCGT GCCACGCGAG ATCTTCTGTG 
TCAGCTTGTA GTAACAGATC AGGCAGCGAC AGTCACCTTA CTCAGTGCGT GGAGAAAAAA 
ACGGTTGCTG TCTTTATGCG AATAAGGACA GGGTGACTCc CATACGGTGA AGGAAGGGTA 
AAAAGAGAAA GTGTGGGGAG GACTTGCATA AAGAAAGAAT GGTGGGTTAn TGGCGCGCCA 
CGTGTATTAG CTAGGACGAG GAAATCATAT GGCATTTGAA ACAATTTCGT CATGCTTAA 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
CTGCCCnAGG CGCGTGTGGG TGGTGCGCAT AACCAGGGGA AGGATATCCC ACCGTGTTGG 
TAGTAGAAAC CCGTGCTTGA GATTGTAGGG ACCGGTGCCC ATAACTGCAC CAGGGGCCTG 
GCCATGACTG CATACCCCCT TGCCATAGCT GATGCCCCAC TCAAGTGTGG CAGAGCCAGT 
TAGCTTCGGG GAAAACTCCT GTCCGAGCAC TCCCCCGCTC GCTCCTACCC CCACCACCAC 
ACACAGCACA CTCCCCCACC GCATGCACCC CATGCTACCT CACCCCCCCC CCGGCCCTGT 
CTAGTAGCCC CCTCACCCTG CCACCTGCAC ACACGCAAAA ACTCACCACT CCTTGCACCT 
GCCTACCCGC CnGCATCGcG CGcACCCCAG GCGCAGACCT TTGCGAGCGC AAACGCACCG 
ACACACCCAG CCACACATCC CATAAAAAGC GTAAACTGAT CCTGCATCCC GGCTGCAGTT 
CCCCGTGGGA ACAACGTCAC GCTCGATCCC AGCACGAGCC CACACACACA gCATGCGCGG 
aCACCGGGTA CAACCGAAAC CATGCGCGGA GCACCCGCGC CGCCCCCAGC AACCCTATCC 
CCATTCCCAC ACAGAGTGGA AACAGGAGCC CCCACACGTG CGCAGAACAC AACCTGCTTA 
TTACCGACGC GCCAGTCACT CCTCCCCCCT GCTCCACCGC GCGCAAAGAC AACACGCTCA 
AAAGCGGTTG GTACACCCCC GCCAGCAGCA ATACCAATGA ACCGGAAAAT CCTGGCGTGA 
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gCATTGCCGC CGCCGCTAGC GCCCCTGCGC ACACGGTCGC CACAAATCCG CGCGTGTGTG 840 

CAGTAGTAAG GACCGCCGTA GTCTGAACAG ACGCAGGGGC GGTCGCAGAG GATGCGTCAC 900 

GCGCGTGCTG CATGCGAGAA AACGCACACA CTGCCACAAA CCCAAGCAGC ACAAAGAAAA 960 

TAATTCTAAC TGCACACCAC ACGCGGCGTG CCGGTTCCCC CACATGCCCG CGCGCTGTGC 1020 

CTTCCGCCTG CACTGAGTGC TCCGCGGcAA CACTGGCGGC ACGCACCCGA TTGCGCAAAG 1080 

ACGGCACACT TGCTAACAGC ACCCCGGCCA AAAAGGCATT CGTAAGATGA GGAAATGCTT 1140 

CGTACAACGC ACGCATAAAG CGTGGgCACA CCCCTATCCC CAmCACTATT CCCCCCGCAA 1200 

gGGCAAGCAA GCGTCGCCAC TGACGGCACA GATGCGCTCT ATCCAATGCT ATTGCTGCAA 1260 

TCAGTAaTTC CCAcGTGCCA CAGAGCAGCG CAACCGTACC CCCCGAAAGA CCCGGTACCA 1320 

CGTTTGCCGC TCCTATTAAC ATTCCTATCC ACACGTGCAT GACCGGTGAA CTCATGCTCC 1380 

GCCTCCCCTG CTGGTCTTCT TTCTGCGAAA AAACGCACCC TCCCCTTCAT CCGGGCACTT 1440 

CTCTCAGTTC AACCGCGTCC ATCGCATGCG CAACACACCC TCACCATACA AAAGGAGTCT 1500 

GAAACTCTAC ATGCTCAGCG ACAATCTTTA C 1531 



(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 98 base pairs 
<B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

TCTGCCCGGG TGTGGTGAAG CCGCnCAGTT GCCGGCGCGT TACnCGGATG GGATGCCAGT 60 

CCAGTGAAAT GCCGCGTGCa GAAGCgTTGG TGCAAACTGC CcTTTGCcAG AAaGCTGCcG 120 

TATCTAGAaT CcAGTTACAC TTTGAwCcGG GTTGTCCcTT TCTCTGAAaG GCTTCTACGG 180 

TTGCaGAGAC GGTACGTGCC aACGAACTTT TTGCGCCAGC AGTCTGATAT TCCCTTGCAC 240 

CTTGGAGTAA CAGAAGCAGG CCCGCTTGTT TCCGGTATTG TCAAAAGTAC ACTTGCATTT 300 

TCCCAATTAC TGTCACGCAA TATTGGTGCC ACGGTGCGGG TGAGTCTTTC AGATAGCATG 360 

GAGCATGAGG TGcTGGygCG CGAGAAATTC TTGCTGAATG CGGTAAACGG GCTGGTGGGG 420 

TTCGTTTAGT GTCATGTCCG CGCTGTGGCA GGATTGGTTT TGACGTACAC GCATTTGTGC 480 

GGAGGTGGCA AAAGGAACTG TTC AGTTTG A AAAAGGATAT CACGGTTGCG GTTATGGGCT 540 

GTGTAGTGAA TGGTCCTGGA GAAGGAAAGC ATGCGGATCT CGGTATCAGC GGTGCGGAGG 600 
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ATTCGGTGAT TTTTTTTAAG CGGGGAAAGA TAGTGCGTCG CATTCAGGTA CGTGATCTTT 660 

GCGCAGACGA GCGCACGCGC ATAATAGACG CAGCGTTTAA AGAGGAATTG TCAAGTTTAT 720 

GAATAACCTG aTCAAAGCAT ATGCGGc GGG TGTCATGAGT GCTGCGTTTC TTTTTGGGTC 780 

AGAGGGGCGG GTGCGCAgTG AATCCGATCG GGTGCGTGGG GAGGATCCGT GGCACCTGTT 840 

ACAGTGGGCA CAGGTTGTCT ATGAGCGAGA GGAATTgGCG ATACGTTGCG CTATGCcAgC 900 

GGGnCACGGG CGCTTCGGCG GGArCAAnTG GAGCACCAGT GCCgAaGTGC TACTGCGTGC 960 

ACGCACACGG GCTGAGTCAG CGGGGATACC CGAAACACTG TCTGATTTAT ATGCACTTTT 1020 

AAAAAGTCGA GGAGAGACAG ATGCCtGCGA AGTGCTTGAT GCTATTTTTC TCACTCATGC 1080 

GCCGCACGTT TTTCAAAACT CCGTTTCCAA AC TGCTC CAG TGGCTGAAGG ATTCAGCCGC 1140 

TTTTCCAGAA GCGGAGTTGC TCTTGGGAAA GGTATTCGAG GGTGAAGGAG AGTACGCCCA 1200 

GGCTTTGCAG CATTATCGAA ATGCGTGGGA TACGCGAGCG CAGCTTGTAG TTCCCGACGC 12 60 

TCGCTTTGAT ATTATCTACG CAATGGCGAA TGTGTCTCGT CTGCTCAGTC AGCAGGATGA 1320 

ACGGGAGAAG TACTTGCTCC TTGTGCTGAG CGAAGATCCT CTGTACAGTG CACGTGAGGT 1380 

GTGGGGCAAG ACGCTGCA 13 98 



(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

AACCGATAGA CGAAGGGACC ACACGCGCTC CTCCCCTTTT TAAATAGAGA AAAGAATAGT 60 

CCCTACGGGA ATTGAACCCG TCTTCTGAGA TTGAAAATCT CATGTCCTAA CCGATAGACG 120 

AAGGGACCAC ACGCGCTTTC CCTAGGGAGG ACTCGAACCC CCACACACAG AACCAGAATC 180 

TGCGGTGCTA CCATTACACA ACCAGGGAGG GAGGCCCAGG CTAGTACGAC ACGTATTTTC 240 

TGTCAAGCAG ATAACGAACA CTCACTCAGA TCAAAAACTT ACCCGGTACC TCCACTTGAC 3 00 

CGACGCAGAC TGGGCAGCTG CCATCAGCAG CCCCACAGGC ACACGCGnCA TTCCGGGCAA 360 

AGACACACAA CTGGCACATT GCCGGATACA ATTGTGGCAG TCGCTCCTTC CAGTTCAGTA 420 

GCGCTATTGc AAAGCGCGCC TGGCTGcGAT ACAAGGAGGA GAACACTTAC TCCGTATGGT 480 

TATTTCAGGa TTACGCGCCA TCGAAGAGTT TCTGCGCGgC AG t CCTnTGC GCTAGAAGGG 540 
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TTACACGCGG GAGGAAAAAA CAGCAAGACG CAACAAAGCT GCGTGCGTGT GTCACGCTCT 
ATTATGCAGC GGAAAATGCG CGCATCAAAC GACTCCTTGG TATGGCCGCG GCACGGGGGA 
TACGGATCAA CACACAACGT GTGCTGTGCT TGATAAGTAG CGCGTAGTTT ACCCnCCTGC 
TGCGCGATCA CCGCGGTATC CTTGCTGTTC TAAGTTCACC GAGCGCAACG TCCGCAGGGT 
TCTCATACAA GAAAAAAACG ACtCCGCTGT GGACAGC TCG CAGGAACGAA CGTTGTTACA 
TGCACTGGCA ACGCACAcGC ACGCGCTTGT GCTCGTCTTA GACGCAATTA CTGATCCCCA 
CAACGTTGGG GCAATTGTAC GCATGCAGAC CAATTTTCTG TCGATGCAGT GCTCCTGCCG 
CACCATCATG GGGCAGGAGG TACAGAAACT ATCACGCGAG TGAGCGCAGG CGCCGTTGCA 
TGGGTACCGC TTGTGCGTGT ACGCAACCTA GTGCGCACTG CAGGTATCCT CAAGCGTTCA 
GGATTCTGGC TATACGGTGC TGATGTAGCA GGAGAAGCAA TAGGCGCCCG TACTTTTCCT 
CCTAAGACAG CGCTTGTGTT AGGCAACGAG GGGCACGGCG TTTCGCTTTG CTGCGCACGC 
ACTGCGACGC ACTCATCTCT ATCCCAACGC AGGGaATGTA GACAGTCTGA ACGTGTCGGT 
TGCCGCAGTA TTCTGTTATA CGAAATACGC CGGAGTCAGC AGTCTCCCTA CTCCGTACAA 
AGGCAAAACG AAATGAACGC TCAATGAAAA CACCCAGGGc ATCTTCGCAT CTTTAATTCA 
TATTGCATCT TTCACCCCAT TGCGTTACCt GAGGGAGTCT CTACGCaCGC GGTAcGGAgG 
GACCCCATGG CACATCTTCC TAAAGAGTAC GATTTTTCTA TAGAGTCATT GGGGGAAAGC 
AAAATTCCCT CTCCCATCTA CCTGTCTCAC ACCCTTGGCG ACTTCATTCC TAAcTACGTC 
AgTGACAATG AGTACATCAG CCATGAACTG AGTGCGCGTC TGGGGGAGAC GGTAGGGCCC 
TTTACTCATA AAAACTTGAT GGAGCGTGCG GGCCCGCGCC AGAAGATTTT CTTCAACCCG 
CATCACGTTC ATGCAGGTAT TGTCACCTGT GGAGGGCTCT GTCCCGGCCT CAACGATGTC 
ATTCGCGCCA TCGTCCGCTG CCTTTGGGGC CGCTATGGCG TTAAGCGCAT TAGTGGTATC 
CGCTTGGCTA TAAGGGCCTC TTGCCCGATT ACAACTTCGA TATCCTGCCG CTCAACCCTG 
AGGTCATCGA TAACTGCCAC AAAACAGTGG TTCGCTGCTA 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13969 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
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TTCGATAGAC GGACAATATC GTGTCCCTAC CGCCTTTATT CTGCCAGAAA AGAATGGAGA 
ACGATGAAAA TTCTCACGGA TGAGCTGATG CGTTCTCTCC ATTGGTGTAA TTAATATAAC 
AGTCCCGCAT GTGGACGATT GATTTCCcAC GTGAGCmAAG GAAAAAGGAC GCATGATAGC 
GTCGGCTTCT TGCTTTTCCA TTACCGAAAG AT CT AC AG A T TTTCGTAGTA CGCGCGCGGG 
CGTACCCGTC TTCAGCCGTC CCATCTGAAA CCCCTTTTTT CTCAGCGCAG CTCCCAGCCC 
TTCAGCGGCA TGTTCCCCTA ATCGGCCTTC AGGTGCCTCA TATTCCCCAA TATACACACG 
CCCTTCCATA AAGGTCCCCG TAGTCAGTAC CACCGCACGT GCAGAGATGC GTCTGCCGCG 
TGCGGTAACT AyCGCATGCG CTGCCCCATA CGC T AC AT AT CCTGCATCAG TGGTATTGGA 
ACACACAACG TCTACCACTG TGTCCTGATA CAGGTGAAGA TGCTGCGTAC ATTCCAACGT 
ATACTTCACC TTCTGGGCAT ACAAAAACTT ATCTGCTTGG ATACGCGGCG CCTGCACTGC 
AGGnCCCCGG CTTTTGTTGA GCAGTCGATA CTGAATCATG CATGCATCCG CAAACTTTCC 
CATCTCTCCG CCGAGTGCAT CGATTTCTCT TACAATATTC CCCTTGGAAA TTCCTCCAAT 
GGAAGGATTA CATGAGAGCC TGCCGATACT ATCGATTGTC TGAGTGATGA GTAACGTGTG 
CTCCCCCATA CGGGCAGCGG CCAGCGCCGC TTnCGGCACC TGCGTGTCCA CCACCGACGA 
CGATAACGTC ATAGTCAGAA AATCTGAAAC CCATGGGCAG C GG ATT AT AG AAGAAAGGTG 
CAAAAGGCTT CAATCAGGAA GCACGACAGA ATTCAGGAAG CAACGCACAC ACAACATCTG 
AAGCCTGTTT CTGATAATAA AGGTACCCGT ACATCACCGC TGCGGATAGG ATCTTCCTCC 
CGTACTCCCG AGGTTCTGCA AGAGGCAGGG TTTCGAGGAA CAGATCATCG GGCAGACTTC 
CCCTCTGTTT TTTCCATTTG CGGACGCGCG ACGGTCCCGC ATTGTAGGAG AACAGGGCAC 
AGAACACGGA GCCATCAAAG CGACGAATGA GATCGGAAAA GAACAGACTG CCGAATCTAA 
CATTGATATC CGGGTCAGTT AAGTCATAGG TGTCGATATT GAGCTTACGA GCGATATCTG 
AAGCGGTTGG CCTCATCAAT TGAGCAAGAC CTACAGCGCC CGCCCGAGAG ATAACcTGCG 
GCTGAAACAG ACTTTCACTC CGGATGAGCG CAAACAGCAG ATACTCCTCA ATATGATACT 
TCCCTGcATA GCCTTGTATG ACGTCAAGCC AC GGACGC GG GTACGCAATC TTCAAGTGTT 
CTACAGAATA GCGCGCACCG TGAGAGCGTA TTGCGTACGA CTGAATGCGC ACGGCATCGG 
ACCATCGTGA TCTTTGTATG TGTGCGTTCG CAAAATGCTC TGCAAGGTGG AGAGGAATGT 
CAGGATAGAA CTGTACCATA GCTTGGTAGA ACATATCGTC CAAGTGATAG TCTACATAAC 
CTTGGAGAAT AGCACGCGAC TCATCGGGGG TAAGAAAAGG ATGCGGTGTT CTCTTACTCC 
GCACCTTATA CAAAGCCTCT TCCAGAGGAA TGCCCAATGC ACATGCAGCA AGAACACGAT 
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AGTAAAGGGA AGAATGCGCA gTTTCGAAGA TGGTACGGTA AGCGCGGCGC gcAcTCTCTT 
CTGAAAGCGT ACCTGAACGC GCAAGGACAT ACGTAACGCG CGCGGAnTTC GGGTAATGCT 
CTATGCGTCA CACCTCTTGC AGTGTAGCCA GACGCCCCCA ATCTTGCTGC GTAGTCAGCT 
GGACAATTGC ATAATCGACA AGGTCAGAAA ACCAAGAATC CGAACGCCAA CGTGGTGCAC 
TTTCAACAAG CACCTTAAAG AAAGAATCAA AGTCCAATGC ACGCAgTACG TCCAAGTAAT 
ACCACAGGGC ATTATCAGCA TCTTGCCGCC TTGTGGCCAC TTTCTCTGCC TTTTTGAAAA 
GGGGAAGCGC TTGCTTCTTG TGAGAAGCGG AC CGCGAGT A AAGACGCGCC GCATAAAAAT 
AGCAATAAAA ACGCAtCGTG CCTCTAATTC CGCATTGGAC AACGTTGAGC GCGATAGATG 
TGCAAGATGG TCAAGAAAAA CCTGCGCGGC TTTAACACTG TTTTCACTAC CGTAAAGCGC 
TGCTTTCCCA AAATCGGAGA GAACCGAATT TGTAGCGTAC CgCGTGTGCC ACGCAGAAGA 
GCGCAACAAC AATCGCACTC TCTCCCACCC CCGTTTGTAA TTCCTGTGAA AGACGTCCAC 
ACGAGCACCA TGTACTTCCT TAAAGATACG CGGCAGATCC GGCAGTGCCG CAATAAGCGC 
AGCAAATTGT GCACTCGTAT GGGGCGCAAG TGCCCTATCG TTGTACCAAG ACTCAACAAC 
CGTTCTCAGA CTGTAGTAGC GTTCGAGTTC AAAGAGCACC CTTGAACGCA AAAGGCGAAG 
TGTCTGTTGC TGCGTTTTGG TTCGCGCATC GCTGCCTGCA TTAAGGACAT GCAGCTGTTT 
TTCAATAAAG GCGAGACGCT GTAGTGGACT CCCTGTATTG TGTGCCTCAT GCGCACATAA 
CTGGCGGTAC GGTGCGGCCT GCGctGCGCC ACGTACGAAG TACTCGTGAG CTTCTTCGGA 
AAACTTGGCC CGCTTGAGGT GTAACCCAAT GAAGTAACTT GCACCTTCCC gCGCGGCGAC 
CTGCTCGAGA AACTCATCTG TTGGCTTTAG AAACGGAGTG TAGTTTTTGT CTCTCAAAAA 
CTGAGGGATA TCAACCTCCC TAGCGCCGCG CGcGCACACA GTGCCCTCCC ACGAGACAAG 
GACACACACC ACGCACCACA CGCGCAGGCA CAACCTGCCC TCCCGTAAGA GAGAGAAAAA 
GCCACCTCCC CACACGAGAT GCTTACCGTG ACGGAATTTC AATTGAGGTT CCTGCAACGA 
TATGATCAGG GTTTTTCAGT CGGTTAAACT CAGCAATTCT CATGTAGCGC CACGGAGTCT 
TGTAATAACT CCGCGCCAAA TCCCAGAGCG TATCCCCCCG TTTGACCGTA TAACGCACAA 
CCTTGACGGA TCCAGACCCA GACTCCTCTA CAGGTTCTGC CGCTGCACTC GTAGCCGTTT 
CAGCCCCGGG AGACGCAGGG GGgACGTCCT TTTCCTTGTG GGTTTCTGGA AGAGCAGTCT 
CAGATGGGAG ATCGACGACA CGTGCCTCAA CCGCAGTACG CTCAAGCTCA CGCGCTGCAA 
CATGCGCAGA AGACCTGGGC AGTTCAACTT TTTGTGACGG CACGACCGCA GGACGAGAGG 
CGCCGTGCAT GTACAGAACC CAACCTACCA AGACACCACC GAGCAGAATG AGAAATGCGC 
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ACACTGCCAA CAAAATCCGT CGTCTCCTCG TGAGGGTAGA 
CAGGGAAAGA AC TG AC ATGC TCGATCTGAG CAGCAGGCTG 
CTGAGGATCT CTCCAACACG GAGAAAGAGG TTTCCTGAAA 
TCGCACGCGC ACGGCACCGA CTCCCATCAC ACTCAACCTC 
CCGCCGCAGC AACGGACAAC GTGTCTATCG AAAGCGC AC C 
AGGTGACCGA CGCCTCTCCA TCGcCAGGAG AAACCCGTTT 
GAGACTGTTG GTCATTCGCG GTAACGAGCT CGAGCGTTGT 
TAAAGAGAGG GaAAAAAGAC CCGTCCGCGA GCTTGATACC 
CACCCCCGCC CGAGCATAGT CAGGCGAAAC CCTTTTGTCA 
GGGGGAGGCC GGCGCTCGCT GAGACTGCAC GCTCAGAGAC 
TGTAGGCTCC TCCTACCCCC CCCCCgTGCC TGTATTCCGA 
TATTGCTGTC CAAGAGCCCA CTCAGGCGGG CGATTACCTC 
CCCGCGTTCG TGCAACCAcT TCCGTGAGCA CTTCCACGTT 
CTGCGCACAC CTTTTGCGCA TCTCGTGTCA ACCGCTCAGC 
ATACCTCTGC TATCACTTTC CCCTGCTCAC GCGCCGTACA 
GCATACGCTC AACGGTACGC TCATTTTCTT CTCCCACCAC 
CCGCAAGGAT GGTGCTGAAA CCGTCCACCT TTCCTCTAAT 
CTCCTGCCGA CGCGTGCCCT GATTCTCCAA TCACTGCCTC 
CCGTTGCACC AGACTCAGCT GCTGTTCGCG CTGCCAGGGA 
CCGCAAAACC GCGTCCTGCG TCACCTGCAT GTGCAGCCTC 
GTAAATTCGT CCGTCCTGAA ATATCTACCA CCAGCGCATT 
ACCGACTCGT AATTTCTTCG ATCACCTCCA TCATGCGGCC 
TACTTGCCGT GGTAATTGCC TCAAAATCTG CGTGCACTGC 
TGTGCACCGT ACTCCCGAGA CGCTCAACTG CCTCAAGCGC 
GTGCTTCCAT TTGTTCATCC AACTCTCGCA AGGAGGAACC 
GCGTTTGAGA AAACCCTTCC CCCTGCACCG CTAACTCTTC 
GCGTCAAGTC TTGCACTACC TGCTCTGCAC GATCCATGGT 
TGTGCGAAAG cTCCTGcGCG GTGCGACGAA CTTCACCAAA 
ACGAGGAAGA TAAGTCCTGA GAAAGCCGCT GCGCAGTCCG 



PCT/^p>3041 

ACCATCCCCT CCCTCGTCTT 3540 

CTCATCCGGT GTCGCGTCTC 3600 

AGCACCAGAA CCTACATCCG 3 660 

GAGGCGAATA GTAGCTCCCC 3720 

CATAGGCTCA AGTTCTGCAG 3780 

AAAAAAGCTC AAAAGAGCAC 3840 

gCGCTGCGCA CCCTCTGCAG 3900 

GATCTCCTGA CTCATGAGCG 3960 

ATCTTCAGTC CATAGGGGGA 4020 

AGCATGCGGT TCACGCTACC 4080 

CTGAGGTTCG TCCGTTGGTA 4140 

CCGCGTATGC AACTCAACTG 4200 

ACCGATCATC GTCTCGACCT 4260 

ATGAGCTGCA CCTCGCGTGC 4320 

TAACTGCCCT GTCACCGTGC 4380 

CTGAACGGCA CCGGAAATAT 4440 

GTCTGTAAAA CTCTGGGCAA 4500 

TATTTCCTTT AACATTTTTC 4560 

GCGGATTTCC CCCGCTACCA 4 620 

AATCGCTGCG TTCATTGCAA 4680 

CGTCACCGCC AAACCACGCG 4740 

AACGCATTCC TTGCCGGTTT 4800 

AGAAAACTTA CCCCGCAAGT 4860 

TTCTCGTGCA CGTTCCCCTT 4920 

AAGCTCTTCA TTCACCACAC 4980 

GCGTACAGCA CCCGTGTAGC 5040 

CTCGGTCAGA TTCTGTTCAT 5100 

CAACCGCTCC AAACGCTGCG 5160 

CCCATCCAAA ACAACCTGCC 5220 
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ACTCGATGAG CGCaGTTTGC GTAACCAAAA ATCCAACAAA CCCAAAAATC AACAGGGGAG 
CTCCTGACAG TACCCCTACA GCCAGGAGCA CATCGTGCAG CGCTGTGACG CCTAACACCA 
GCACCCCCAT ACTCAAAGGc ACCGCGCCAC GCTTTTTGCG GTACAAGACT TTGCACATCA 
CCCAGAGCAC AATACCCAGC AAAAGCAATA CGAACAGTTG CTGTAAGGGA AGTAATCGGG 
CAAAAGAGGC AGGGGGAAGC AAAAGGATAA TCACCGCGTA CGCAAGCCCC TCTGCACCAA 
ACGCCACCAC AAAACTCTGA TTCACAAGCC CAGGGTACAA CGTAGAAAGA TAGTACAAAC 
AGGACACCCC CGAGAACGCC AGGGTCAGGT ATTCTAACCG CACCATCGGG TCCCACCCAA 
TCGTGATCAA GCGTGGCAGA AACGCATTCC CCGTCAGCAG CAAGCGAACC ACAATCAGCA 
ACGAAAAAAG CGAGCACGCA TACAGGCTCT TTTCGCTCAC TCCCAACACC CCTGCAGTAT 
CTGCCCGCTC ATTCTCCGCA TAAGACTTTC TGCTGCAGGA GGACAATCGC CGGAAACAAA 
ACATTGCCAG GTAATACGCG AATATCGTAA ACGCAAAACC AATTGTCATC GCCTCAAGCA 
CGTCCTTGCG CAGaCgCGCG TACGAACGCG CGAAACAGAC CCCAAGTGAA CTTCACCGAC 
AATGCCCGGT CTTAGACTGT GGTAATTGCT TACCTGAATG CACACGTCAA TTTCAGGTTC 
GTGCGTTGGC AACCACACTT CGGCAGGGTG GACGTAGGGG ACGGCATCTG CTCGGTTACG 
CGCAACGGTA CCAAGTTCTG TTAAGAGATG CCCGTTTGCG TAGATTCTTG CTGCGTAGTT 
AAGCGTATCG CAGGACAGCG CAAgCrACGG AGCGC GAGGG GGAAGCAAGA TTTTCAGCGT 
ATAGGTGGCA CATCCGTAGT GAGGATACGC CTGAATAGCG GGTATCTGAA CCGCGCCCCG 
CGgTTTGGTC CACAGGGAGG GAACGGTCAT AAACGCAGCA GGCTGCTCAG CTGAGTGAGT 
AAGTAACTCG TTCCAGTGAA ACCCCCACGT TCCCGAGAGA GAAAGCAGTG CATCACCAGA 
GGAAAAATCC CACTGACGGA GATCGAGCAC TCCGTTTTCT GCCGAAGaGG CGCCGGGAGC 
GCAGTGTGTG AAACCGCAGC AGAAAGGGCA AAAGCAAAGA CATACGAGTA CGTAATGGCA 
CGATATTTCA CGCCGCGGAC TATACGCGAC ACATCCAAAA TGATCTACGG ACTGCACTTG 
CTGTTCACGC AATCACACGG GCATGCCCAA ATGAGGAATT TTCATCGCCC GCATCAGGGA 
TAGTGCGCAC GTTTTTGGAA GGGTTCTCCC GCAGTGGGGA CTGCAACGCT TTTTCATCCA 
AAACAAAATA AAACTGTTGA CACGGTTTAT TCTCCTGCCT CACACTGGCT GCAGTCCGCG 
CCATTTTAAG TCTCTCCGTA AGGAGCGTCT CATGCACACG CAAAGCCTCA GCCCCAGGCA 
GTTCATGATG AAAATACTCA ACGGGTCTTC TGCCGGGATC GTCATCGGTC TTGTCCCCCC 
CGCTATCGCG GGGGAGTTGT TCAGAGCGCT TGCTCCGCTT TCGCCGCTGT TCGCCGCGCT 
CTACCATGTG GTGCTGCCCA TACAGTTCAG TGTACCGGCT CTCATCGGTA CCCTTGTTGG 
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ACTTCAGTTT CACTGCTCCG CGCCCGAAGT GGCTACCCTC GCCTTTGTTT CTGTTATTGC 
CTCAGGAAAT GTCACGCTTC AAAATGGCGC CTGGTTGATC ACCGGTATCG GGGACGTCAT 
CAATGTTATG CTCATATCTG CACTTGCAAT CATACTCGTC CGTGCTCTGC GGGGGAAACT 
TGGTTCGCTG ACCATCATCG CGTTGCCCGT TATCGTAGCT GTTGTCGCAG GGGGTGTCGG 
CTCCTTTTCC CTGcCCTACG TAAAAATGAT TACGCTTTTC GTCGGCAGAG TTATCGCCAC 
GTTCATCGCG CTCCAGCCAT TACTCATGAG TATCCTGCTG TCCATGTCTT TCTCGCTCAT 
CATCATCTCC CCTGTGTCTT CCGTCGCGGT AGGAATCGCC GTGGGGCTCA CCGGTCTGGC 
AAGTGGAGCA GCAAACATGG GCGTCTCCTC CTGCGCCATG ACCCTCATTG TGGGAACCAT 
GCGCGTCAAC AAGATCGGTG TTCCGTTGGC GATGTTCGCA GGAGCGATGA AAATGCTCAT 
GCCAAATTGG ATCCGGTACC CGATTCTCAA TATTCCGCTC C TGCTCAATG GCCTCGTTTG 
CGGCGTGCTC GCGTGGCTTT TCAATCTGCA GGGTACTCcT GCAAGCGCAg GCTTCGGTTT 
TATTGGACTT GTTGGACCGA TCAACGCCTA CAGGCTTATG GCGTACACTC CTATGGTGCG 
CGCGGGTATT CTTTTCCTCG TGTATTTCGT TCTTTCCTTC CTTGCTGCGT ATCTTATCGA 
CTTTATTCTC GTTGACCGCC TCAAACTTTA CCGGAGAGAA CTCTTTATCC CCGAACAAGG 
GTAGATATCC TATATGTTAT GTGTTTCCGC CCAGGTCCTG CGTGAGATAC GTGCAGAACG 
TGGGTAAGGA ATGTTGTTTG CCTTACCAAG GAGGTGCGAA ATGAGGTGTG TTGTCTTTAA 
TCTTCGAGAA GAAGAAGCCC CTTACGTGGA GAAGTGGAAG CAGTCCCATC CAGGGGTAGT 
CGTGGACACT TACGAGGAAC CGTTGACCGC AAAGAACAAG GAGTTGCTTA AGGGGTATGA 
AGGGCTCGTG GTTATGCAGT TTCTCGCTAT GGAAGACGAG GTGTATGACT ACATGGGTGC 
GTGCAAACTA AAAGTCCTTT CCACACGTAC CGCAGGCTTT GATATGTATA ATGCAACTTT 
GCTGAAAAAG CACGGCATCC GGCTGACGAA CGTACCGTCC TATTCACCGA ATGCTATCGG 
GGAATATGCA CTCGCCGCCG CGTTGCAGct GACGCGACAT GCGCGCGAGA TTGAAACTTT 
TGTAAGGAAG CGTGATTTTC GCTGGCAAAA ACCAATTCTC TCGAAGGAGC TCCGCTGCTC 
ACGCGTAGGT ATCTTGGGAA CGGGCAGGAT TGGACAGGCA GCAGCAAGGC TCTTCAAAGG 
GGTTGGTGCT CAGGTAGTTG GTTTTGATCC GTACCCGAAC GATGCCGCAA AGGAATGGTT 
AACCTACGTG AGTATGGACG AGCTGCTGTC CACTAGCGAC GTGATCAGCT TGCACATGCC 
TGCGACAAAG GACAGTCATC ACCTGATCAA TGCGAAAACA ATCGCGCAGA TGAAAGATGG 
CGTGTACCTG GTGAACACGG CACGCGGAGC AGTGATCGAC AGTCAGGCGC TCTTAGACAG 
CTTGGACAAA GGCAAGATTG CAGGTGCTGC ACTGGATGCG TACGAGTTTG AGGGTCCGTA 
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TATTCCTAAA GACAACGGGA ACAACCCTAT TACCGATACG GTCTATGCTC 
ACATGAGCGT ATCATCTATA CCCCTCATAT CGCCTTCTAC ACAGAAACAG 
CATGGTATTC AATTCGCTTG ACGCCTGCAC CACGGTGCTG CGTGGGGAGC 
TGAAATCAAG CTGTAACTGA CGcCAGGTGT CCCTGGTCCC GTGTGAGTCT 
TCGGTCAGTC TGGAGTCGCC AGCTCAGGGT GGGTTGTGGG ctCCGCGGGA 
CCGGTTACGT GCGGGCGCGG CCCACCTGTG TGAGCGCGAT AACCAACATC 
CCGAGACCAG TGCGGTGAAT CCCCCTGGGG CCACGTTCAA GTAATACGAG 
CCAGCGCCGT GTCCAGCATA CTAAATAGAA ACGCCGCCAC CAACGTAAGC 
CCCGCAGCTG TAG CnTGTCG CAACCGGTAC GGTCATGAGC GAGCTCAGCA 
GGTAATCTTT ATAGAAGCTG CTATAGTCGC TGAAATTACC ACCGACGCGA 
CCCGTCTGCT GCGACGCCAC AGATACGCGC GGTCTCTTCA TCAAATGCCA 
CTGATGGTAG CGCAACGCTA GCGTACCTAC GCAGAACACG CTGAGTGCGA 
CAAATCGCGT G TAG AAACAA CCAGTATGCT GCCAAACAGA TAGCTGTCTA 
GATAAGCCCA GAGCTCAACA GCGTGACAGC AATACCCACA CTCAGGGAGA 
AAGAATCAGG TCATGATGGT TTTTGAAAAA GGCGCGCAAA AACTCTATCA 
CAAGGCAGTG AAAAAAAAGG ATCCCCATCC TGGATGGATG CCGCACGAAA 
TACTCCTGCA AGTGAACCGT GCGCAAGTGC ATCTCCCATG AGCGCGTAAC 
TAAGTGCATC CCCACAAGAG GACACAACAA GGCTATGAGA AAAGAAGCAA 
GCGCATAAAT GCGTACTGCA ACATCACCGA CTCCGACACT GCGCACAGGC 
TTTTTCTGCA TATCCAAAAA CTCACTGACG TACTGCTGAG GATTACACAA 
CCTTCGCTGA GATGAAAAAT TTGCGTAGAG TTTGTAATCG CTGCATCAAG 
ACCGATATAA CCGTTACGTT ACGTGATGTG TTCAATCCCT TCAGCAGAGC 
TTCTGTCCTC GAGAATCAAT ACCTGTTGAC AGCTCATCGA GCACCAGCAA 
CCGATCAGGC TCCGCGCAAT GTACACCTTC TGTAATTCTC C TCC AG AG AG 
AGCTTTTTTT TCGCACCCCG CATACCCACC TCCTCCAGCA CAGCATCGAC 
TGCGATATGC GCAGAAGTCT GCGATACGAG TTAAGCATTT CATATACCGT 
TAGAGCGTGT GCATCTTTGT CTGTGGAACA GAACCAACAC GCTGTACAAA 
GTACCGGTGC TCGGCTTAAG TAATTTGAGG ACAAGCTTCA CAAGCGTGCT 
CCATTTTCTC CTACAACGGA AAGGTACGCG CCTTTTGGTA TTGCAAGATC 
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AG TAT AAAGC GCGCGTCTGC GGTGTACCTG AAACTAACGT TTTGTAAAAG CACCGCGAAG 
GGACTAGCCA TGATGCGCAT GGTTATACTC TATTTTTATG AATGCAACAC TACTGTCTCT 
GAGAAAAAAG AGACCACAGC TGCACAATGC AAATTGTCTC TCTACCATGG TTTTCGAAGA 
AATTCCCGTT CAACATCCTA AAAAGGAATT GCATATTTTG CAGAGAGTGT GCGAGTACGC 
GCCCGTTTGG GGTGGAAGCC CCTTCAAAGG AGCCATCATG CAACGCTGCT CAGTAGTTGC 
CGCCCTTGCG GGGGTGGTTT TTCTTGCACA GGCGTGTTCG CTATCAACAC CTTCTCGCAT 
AACCCACACG GATAAGCTGC CTGTTGTGGT GACATTTAAT GCTCTCAAAG AGTTAACACA 
GATGGTAGGT GGAGAAAAAA TTCATTTAGT GTCCATCGTT CCTGATGGGG TTGACTCTCA 
CGACTTTGAA CCAAAAGCAA AACACATGGC CTTCATTAGT GATGCCAAGG TCATCGTGTA 
TAATGGTCTT GGCATGGAAC CCTGGATACA CTCGGTACTC CATGCTGCAC GTAATAGCGG 
CAGTATACGC GTAGAAGcTG CGCAGGGCAT TGTTCCGCTG AAGGCTCACA CACGTGGGcA 
TACGGCGCAC CATGTACATG CACATGCATC GCACGGGTCT GCGTACGACC CTCACGTTTG 
GCTCAGCGTA TGTAACGCTC AAACGATGCT TCGTACCATC GGAAAGGCAC TGTGTAAGGC 
GGATCCGCAG CATACGCGCT TCTACAAAAG GAATGCCCGT AATGCGGCCG CACGGCTTGA 
GGCGTTGTAC AAGGAATACC GCTCCAAGTT TGCAGCCTTA TCTCATCGAT ATTTTGTGAC 
CACGCATGCG GCGTTTGGTT ACTTGTGCAG GGATTTTGAC CTCCAGCAAA AGAGTATAAA 
GGACGTCTTT AACACAGAAG AACCTTCCAT CAAGAGACTC GTAGAGCTCG TCGAATTTAG 
CAAAAAACAC TCAGTGCGGA CCATTTTTAG TGAACGTGGT CCTAGTGAAA AAGTCGCTCG 
CGTTCTTGCG CAAGAGATTG GTGCTTCAGT TGAAACCATC TACACTATGG AAAAAAACGA 
GGAGAACCTT TCGTACTACG AAAGGATGAA ACACAACATT AACAGGATTT ATCGTGCCTG 
TTCAAAACAG GTGACACCCT CGCAATAACA ACCGCTTTGC ACATTATGCG TTTTTCTGTA 
CACTCACCGC CATGTACTCT TGCTTAAGGA GGCTTTTTGG CATACGGGGC ACGGGGACTC 
TGTGTGCCAT GTCCGTTTTT TGTCTACTTC TTTCCTTTGG AAGGCGCTGT GTGGCGGCGG 
ATAATTTCCT TTCTTTCCTT GTGTGGAATC TGGTTCTTGC CTTCATCCCC TGGCTCATCT 
CGGCTATCTT GCACGTGCnc GnCTTCGCTG TCCGCAGTGT ACAGCTGTTC CTTATGCTGC 
TCTGGCTATT GTTTTTCCCC AACGCTCCGT ACATC CTT AC CGATATTATC CACTTGGGAA 
AGGGTAAGTC ATTTTTGCTT TACTATGACC TTATTATTTT ACTCGCCTAT AGTTTCACTG 
GTTTGTTCTA CGCGTTTGTC AGCCTTCACC TTATTGAAAG CATATTAGCC CGTGATTTTC 
ATATCAAAAG GCCATTCATA ATTTCAGTAT TTGAATTGTA TCTCTGTGCA TTCGGTATAT 
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ATCTGGGGCG TTTCTTGCGC TGGAATTCCT GGGACATTGT CCTACATGGA 
TTTCTGATAT TGG T ATCCGC GTCATCAGGC CAGTGTTCTA TGTTGACACC 
TGTTTTTTTT CGGCACCATG CTCGTTCTTT GCTATCAAAG CTATCGATCA 
ACACAAGAAA TGACAAATGA ATATCGTTCT CTTTGAACAG GAAGAGGTAG 
CGCTGTACTT TCTTTCAGGG ATAGTCGATT TTGCCATATC AAGCGTGTGC 
TGCGGGAGCC TGCTTCAAAG CAGGGATTAT TAATGGGGTG AAAGGTTCTG 
CCTAGCCACA GAAAAGTATC TCGTAGCCGT TTTTGAAAAA CTGGAATACG 
CCTTTTCCCC CTTCATCTTG TCATAGGGTT CCCTCGTCCC ATTCAGCTCA 
ACGCGACGTG TCCAGCCTCG GGATCTCCTC TATCCATCTT GTAGGGACGG 
GCGATCTTAC CTAGAC TCAG GACTTGCTCA CATGGAAAAA ATGCACACGT 
TGGCCTAGAA CAGGCAGGAG GCACGAAACT TCCCCTCATT ACTGTTTCGG 
CACCTTTTGC TCACAACACA CCCACATACT CGGCGACAGC ACACACCAAA 
TGATACTAAG AACACCCTAA CCGATCTAGG AAGCGCCcGC TGCGCGGGGA 
ATTGCAATAG GGAGTGAGCG TGGATGGACC GAATCTGAAC GTTTACTTTT 
GGATTTAGAG CAGTAGACAT GGGAAGACGG ACCTTGCGCA CAGAGACCCG 
CsCGTGCGCC GTTGTACTCG CCAACGCGCA CGCGTGGAAA AGAAAAATCC 
CAAGAGATCT TCGCCCATAA GTCGAAAGAA TCCCTAGATC CGGATCACAC 
AACCAGAAAA GCCCGAGACA GGCCTGAAAC AAGGAAACAC AACCAAAAGA 
ACGCGTCAAC CGAAAGCCAT TGCTCCCGAC CGAGGCGCTA AGTTTCAACC 
AACGCCGACC CCTTGAGGAG ACCTCCCAAA AAACCGCGGA AAGAAATCCA 
ACCGGCGTTG TATACACGGA CCGGAAGCGG CGCCAGGGGA ATTACCCGCA 
GCCGAGATCC ACGGGAGACG CTCATCTCAA GAGATCATGC CCTTACGTAT 
CCGGCGCTAA CTAGATCAAT ACATACCTCC CATACCTCCC ATGTCAGGAG 
AGAGGAACTC TTTTCGGGAA TTGCAGCAAT TGCACATTCG GTAGTCAACA 
AAC CGAAGCC GCGTTCTGTA GTGCTGAACG TGTAACcTTC GCCGGATCGA 
CTTAATCATA TCAACCCATT CCATCTTGGA TGCATCAAAA CCGATGCCAC 
TGCCTTCTCT GCCACAAcTG CGCCATCAAT ACCCGCGTTC TCTGAAATCT 
CTCCTCGAGA GCACGACGCA CAATCTTAAA ACCAACCGCC TCATCTGGAG 
CAAATCAGCT TTCTCGAGCG CCGCCGCAGC CTGAATAAGC GCTAAACCAC 
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(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 




13969 



AATACCTTCC TCTATTGCCG CACGTGTCGC ATTTAAGGCA TCTTnCATA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

CACAGCACCC TACTGCGCCT TCAGGTACGC GACGCACACA TCGCAAGAGA AATTGCCGCC 60 

TCAAAAAGTC TTGCATCGCA CCTTTCCCCC ACAACCGCTT GCCTGCAGGA CGGTCCGCTG 120 

CCCACACCGG aCACTGCGCG CATTGCCCAG CCGGATACGA CACCTCAGGT GCAGGGTGCA 180 

CAGACTGGAT GCAGTTACGA CTTTCTATTG CCACGCCTCC ACCGAGGCAC CGTTAAACAA 240 

TTACTCTTGC GCCACGGATG GCCGGTGCAT GATGAAGTGC CTCTCCGCGA GGGAACTCCC 300 

CTATCCTTGC GCTTACGCAT CTCTCCGGCT TCGTGTCCTC CTCCTTCTAC TGCGTATCCG 360 

TGTTGTCACA CGCCAGGGAC ACCCTCGTTT GTGCCACGTG ACTACCAGTG GGAAGCAGCC 420 

GATGCCTTCG TCGGGAATCG CACACAAGGG AGCGGATTTG GTGTGGTAGT TTTGCCGTGT 480 

GGCGCGGGAA AAACGGTTGT CGGTTT AC TC GTTATGGGTC TGTTGCAAAC GGATACCCTG 540 

ATTCTGACTC CTAATAGCGC AGCTGCACAG CAATGGAAAC kTGAATTgTG TGAAAAAACC 600 

GACyTGGACG GGACATCCAT CGG TATCTAT TC AGG AG A t G CAgGAAATCA GACCAGTGAC 660 

TATCGCAACt ACCAGATAcT CACCTGGCGT GCGCATGCAG ACGCTCCcTT TTCCCATTTC 720 

CGTCTCTTTA TGGAACGCAG TTGGGGTTTG ATTATTTACG ATGAGGTGCA CTTGCTCCCT 780 

GCACCGCTTT TCCGTATCAC CGCAGAACTT CAGGTGGTAC GACGCTTGGG ATTAACTGCA 840 

ACGCTCGTGC GAGAAGATGG CTGTGCGCAG GATGTGTTCA GCCTCGTAGG ACCGAAGCGG 900 

TATGACGTGC CGTGGAAGGA TTTAGAAGCA CGCGGCTGGA TCGCACGGGT GCGGTGCGTA 960 

GAAGTTCGGG TAACGATGGA CCGGTCACTC C AG TAC C AG T ACATGACAGC TCCTGTGCGC 1020 

CTGCGACATC GCCTTGCCAG CGAGAACGAA GCAAAAGTAG CGGTGGTACA GCGTCTATTG 1080 

CGCGCACATG CAGGTGCGCC TACACTGATT ATTGGGCAAT ACGTGCAGCA GTTATTACAT 1140 

CTCGCACACG TACTGCAGGT GCCACTGGTG AGCGGAAGAC AAACTTATGC GGCGCGTGAA 1200 

GCCATCTATC AGCGTTTTCG CGAGGGCACG CTCCAGGTGC TCGTTGTATC AAAGGTGGcA 12 60 

AATTGTGCGC TTGATCTTCC TGACGCGTCG gTTGCAaTTC AAGTTTCCGG GaCATTtGGC 1320 
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AGCCGTCAGg AGGAGGCGCA ACGCcTCGGA CGCCTCTTAC GGCCAAAGAT ATGCGACGCC 
CATTTTTACT CGTTAGTTAC AGAACAAACG GTGGAAGAAG ACTGTGCAcT GCGTCGCCAG 
CGGTTTTTGG TAGAGnCAGG GTTACACGTA CGAAACCcTT CGCGTAAGCG AAGTaCACGA 
ATAAAGGATA CTCCGTGCAG AGTCCTCCCT GTGTGTGTGA GGgGGGGGGG AGGAGGGGGT 
GACCGTGCGG TCTCCCTTGT TTTTTTGGTT CAAGACCGCT ACAGTACTCC ATGCTCGTAC 
GCACTGCACT CAGGCTCATC TTTGGCTCCC AGCACGAGCG CGATCTGAAA AATCTCCTGC 
CTCTTTTGAA TGCCGTCAAC GCCCAGGAGT CCTGGGTACT TCCTCTCCAG GAGTCTGAGT 
TCAAACAAAA AACAGCTGAG TTTAAGGCGC GTGCCGCTGC AGGAGAAGCG CTTGACGCTT 
TTTTACCTCA GGCATTTGCG CTTGCGCGCG nAGGCAGCTC GTCGTGTTTT AGGCGAGCGT 
CCCTATGACG TGCAGATCCT CGGTTCCCTC GTCCTCCACC ACGGCAAAAT CGTGGAAATG 
AAAACGGGCG AAGGCAAAAC GCTCATGAGC GTGGCAGCGG CGTATCTGAA CAGTCTTTCG 
GGGAGGGGTG TGCATATTGT CACGGTCAAC GACTATCTTG CTGAGCGCGA CGcggAnTGG 
gATGCGTCCA GTATATGATT ATTTAGGCGT TTCCGTCGGC GTCATCCTCT CTTCCATGGG 
CAGTCAGGAG CGGCGGTGTG CGTACGCGTG CGATATT AC c TACGGTACCA ACAATGAACT 
GGGCTTTGAT TATCTGCGCG ACAACATGCA ATTTTTAACG GAAGAAAAAA CGCAGCGTGA 
TTTTTACTTT GCCATTATTG ACGAGATTGA CTCCATTCTC ATCGACGAGG CGCGCACACC 
GCTTATTATC TCAGGGCCTg CAGAAAATGA TACCCAGCAT TACGCCGAGG TTGACAGACT 
CGTCGGGCAG TTACAGGAAG TGGAGCGAAA TCCTGCCACA GGTGACTACC CCAACGAAgT 
GGACGGAGAG GAGGTTCGCG GCGATTATAT CGTTGATGAA AAGAATCGCA AGGTTTCCTT 
CAGTGGTCCG GGGATGCTGC ACATTCAGGA wtGCTCACGC ACGCTGGGCT TATCCAAGGG 
AGTCTATTTG ATGAAGAGAA CTTCAAGTAT ATCCACTACT TTACGCAGGC aCTCCGTGCG 
CACTTACTTT ACCGCGCAGA CGTTGATTAC GTAtAAAAGA CGGACAAGTA CAGATCGTAG 
ACGAGTTTAC CGGTCGCATG TTGGAAGGTC GGCGGTATTC TGACGGATTA CATCAGGCAA 
TTGAGGCAAA AGAACACATC CGCATTGCGC AACGTAATCG CACTATGGCA ACTATCACGT 
TTCAGmACTT TTTTAGAATG TATAAAAAGC TTTCTGGAAT GACGGGAACT GCGGATACCG 
AGGCGTTGGA GCTCAATAAA ATTTATAAAC TTGAGGTGGT AGTTTTGCCc GACGAATCTT 
CCCGTAGCGC GGGTGGATGA GCATGACGTG GTATACCTGA GTGAAGAAGA AAAGTGGAGT 
GCCATTTGTG ATGAAATAAA GGAGGCACAC ACACGGGGAC AGCCGGTACT CGTGGGCACT 
ATTTCTATAG AAAAGTCCGA AAAACTCTCT GCTCTGCTGA GAACACGCGG TGTAAAACAC 
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GAAGTTCTCA ACGCTAAAAA TCACGCGCGC GAGGCACTGA TTATCGCCGA AGCGGGGGCG 
AAGGGTTCGG TGACCATCGC AACCAACATG GCCGGACGCG GCACGGATAT CAAGCTAGGG 
GGTAATCCTG AATTTCGTGC ACGACAGAGC GCAACTGCCA TAGCATCGAA GCACGGTTCC 
TCCTCTGTCA CTGTGCAGGA ACATATGCAA GCGTGCTATG AGGCGGAATA CACACGGTGG 
CGCGCAGATT ACGAAGAGGT TAAGCAGCTC GGTGGTTTGT ACGTCATTGG CACAGAG 
(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1462 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi ) SEQUENCE DESCRIPTION: .SEQ ID NO: 114: 
TTGCCCCCAC GnTGAAAGCG CTCCTGGTnA TCGGCGGCGC ACATTCGGCA AATACCCAGC 
GTCTACTCCA CACCGCGCGC GAAACGTCGC TACCTACGTG GCTGGTAGAG CGTGTAGAAG 
ATATTCCCCC CGATATCTAT GCCTTCAGTG CGGTGGGCAT CAGTGCAGGG GCTTCCACCC 
CAGACTGTGT TATCGCTGCT GTGGAgCAgG ccTGCGCACG GGCGGCgCGC CTGTCGCTTC 
TCGGGTGTCT TCCTCTGCTC TGCCCAAGGT GAGTACCTGC AGGGCTGTTT GTGCGGCGGC 
TACTTCTTCC GTCGGTTCAG CGGGTGCATC CGGCGCGGTG TCGCCCGGTG CTGTCCGACC 
TTTTGCTGTA GGCTCCGTGC GGTGAAAACC GyTTCCCTTT GTCTGCGAAg TGCATGGCTG 
AACGTTGCGC TCGGCGGGGT GCTGGGTGTC CTTTCCGGTT GTCGTtCTTC GACGAAAGGG 
CGGATGTGCG CGTCGACCGT ACGGGAAAAG AATACCTCGA TGCACAGGTT GCGTGTGCAA 
AAGAGGAGCT GCGCGCGCGT CCTCTGCGGG CGCTTATGTG TGCAATTGCG CTCAAAAGAA 
ATGCCCCAGC ACATCAGAAG GTGGCTCAGC TGTATGCCCc AGGgcTTGCG CGCGTCAAAG 
AGGCGTTTCG CTATTCAGTG GAGAAACAGA AGTGGTCGGA GGCACTTGTG TTTTTTCGTT 
CCCTCTCGGC ACTTCGCATT CCGCTGAAGG ACTGGACGGA GCGATCGCTG CATCGTGCGC 
AAATTGAACA GTGGAAAAAG GAGGGTGCGC ACGTATTGGT TGCGGCGCAA GAGAAGCGCG 
CCGGAACTTC TgCTGCGCGG AGTCCGGCAG CCATGATAAA GGGGACGGTC ACCATTTTGG 
TAGATCGAGG AATTCGCGTA GAGCACGGAC GCGGGTTTGC AGATCGAGTT ATCGGGTCAG 
GTTTTTTCAT CGACAAGAGG GGCTATATCG TCACTAACTA CCACGTTATC AGAAGCGAGG 
TAGATCCTGC GTACGAAGGt ATTCGCGTGC GTACATCAAG CTCCCCTCAG ACAACACCGT 
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GAAAGTTCCG GTGCGCGTTG TCGGGTGGGA TGCGCTTGCA GATCTTGCAT TGCTAAAAAC 
AGAAATTACT CCTGAGGTGG TGTTTGGCTT AGGTTCCTCA AAGAATTTGG ACGTGGGGAG 
TAAAATCTAC GCGATAGGAT CGCCTGCTGG GCTTGAACGA ACGCTTACTT CTGGCATCGT 
GTCTGCGAAA AAGCGCaAAC TGCTTTCAGT CGGTGGGGGA GTGCTGCAGA TAGACGCATC 
CATTAATCGA GGGAACTCAG GCGGTCCAGT TATCGACGAG GAAGGGTGCG TTCAGGCAGT 
AGCGTTTGCA GGTGTGGAGC AGCATGCAGG GCTTAATTTT GCCATTCCTG TAGAATTGCT 
CAAGCAGGTG CTGCCAACTT GT 
(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4532 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
CTTTTTGATG ACGCACGAGC GCTTGTGCCC CCGCGGGAAT TTTACGGGGG ACAGGCTCAT 
GCGCGGCAGC AACAAGCGTA ACCTGATGTC CCCtCGCGCT CAAACGACGC GTAAGTTCAC 
GCCCATTCGT ACCGCATCCA ACAACAATAA CCCTCATGTC CGAAGnCCAT AGTAGCACGA 
AATTTTTTTG CATGGCCAGC GCGCAGAACA CGGCGCACAA CGCCTGCCAC TCATATCTTT 
TTCAAAAGTA CCACTACCTG TGCGGTAACC GCCGCACCAG ATCCAACAGG TCCGAGGCGT 
TCGGCAGTCT TTGCCTTAAC AAAAACACGT GTTACGTGcG TGTCCAGGGc CTGCGCAAg s 
GATGcGCGCA TCGcTTcCCG AAATGGGTGT AATGCAGGCT GCTCAAGACA GACAACAGCA 
TCGAGATTCA CCAGCCGcCA GCACGCTGCG CGCACCAGTT GcCAGGTATG GCGGAGCAAC 
GCGCAAGAAT GTGCGTCTTT CCATCGTCCG TCACAAGAGG GGAAAAACGT GCCAATATCC 
CCCAGGCCTG CTGCACCCAA AAGGGCGTCA ATGCTCGCAT GCGCAAGAAC GTCTGCATCC 
GAATGACCCT GCGCTCCCTT CTTACTGGGa ATATGTATCC CTGcAAGTAT CAGcGGTCTT 
CCTGcACACA GcGCGTGCAT ATCAGTCCCC AGTCCAACGC GCAGGCACCT TCTTCCGTAC 
ACGGcAGGAC TGATATCCCC CTCGTGAGAG CAGGCTCCGA TGCCCGCTGC TcCAGATCCT 
CCGGATAGGT AATCTTTACA TTACTGCGTT CACCGGCGCA GACATGTACG GTCCCTCCGT 
AGCGAGCGTA CAGCTCcTGt TCATCAGTAT AOTGTTCCCC ATCAGTAGCA GCGCGGTGAT 
GGGCAGCGCA CAGCGAsGCG TAgcAAAAAC CCTGAGGGGT TTGTGCTAAG CGCACTCGAC 
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TCCGTATAAG ATGC GTTTCG ATACTCCCAT CGGCAGCAAC ACCCTTGGGA GTATCCGTCG 
CCTCTATAAC CGGCACTGCC GCTCCATAGC GACAGGTAGC CTCAAGTACA GAATGAATAA 
GCGCAACACT CACAAAGGGA CGTGCGCCGT CGTGCACCAG GACCACATCG GGCGCATGCG 
T As C ATCGCG TCAAGCCCCG CGCGCACAGA CGCACTGCGT GTATGTGCAC CCGGCACGTA 
AAGAATGACT GGAC GCGTAC GAGACGGGAA CGCCGAAAGA CGCGAATCAC ACGCCACTTG 
ACTTTCTGCG TACGCAACTT CACCTGCAGG AACGGTAACA ACGACAAGGA AGAACGACCG 
CGCCTCAAGG GCACGCACGA GTATCTCAGA AAGGAGGCAG ACACCAGGCT GCCTGGAAGT 
TAACGGCAAG TACTCCTTTT TCTGCACACA CGCACCACCT CTGCGCATGC GTGCAGAGCT 
GCCTGCAGCG GTGACAAGCA ACGCGGCGCG GgTCACCCGG GcACATCCAC CGTACCCGAC 
ACAACTGGGT TTTCCTTTGC CTTTTCCAAA TAGCCATGGA TGAGCGTTTC AACCTcAGGG 
GGTCGCAGCC CCAGCGCAAA ACACATCTCA TCCTGAAAAA TACGGTATGC AGAATCATAG 
AGCCTTCGCT CCTGGATAGG CAGTTCCTTG ACTTTACTCC GGTGGTAGAG CGAGCGCACA 
ACGGCCGCAT TGTCCAAGAT ACCACCACTT TTAAAAAGGT TTAAATTGAC CTGATAACGC 
ATTTTCCAAT CAAGAGGACT AGGATCAAAA TCCTCAGACA GAAACCTCAA CGCGCGCTCT 
GCTTCCTTCC TTTTGACAAT GGTTCTAATA CCCAGTTCCT GTGCTTTATC CACCGGAATA 
AGCACCGTCA TATCTGACTC TTCCAAGTAA ATGACGTATA TAGCAGCGTC TCGTTCTTAA 
ATGTTTTTTC GCTTATTTCC TGcACCTGAC CGACGCCCTG TCCTGGATAC ACCACGTGAT 
CGTGGGGACG AAACGCACAC GCCTTACCCA TGGGGCCAGC GTACACAAAC ACGGAAAAAA 
GTCAGTGGGA AGAGGAAGGG GAAAAACGAG GGAACTCCAC CACGCCCGAG TAGCCATAAC 
ACAAAGAACG TGTAGACTGG CGCACCCTTT TGTACTACTA TGCGCGCCAT GGCTTGCGTG 
CGCCGAGTGC GAAATTTCTG TATTGTCGCG CACATTGACC ACGGTAAATC CACCCTTGCT 
GaCCGACTCA TCGAAAGGAC GCGCGCGGTA GAAGAGCGTC TGCAGCACGC GCAGATGACC 
GACAACATGG AACTCGAGCG AGAACGAGGT ATAACTATTA AAAGCCACGC CGTGTGTATT 
CCCTACACGG ATGCACACGG CACCGAGTAT GTGTTGAACT TTGTAGACAC GCCGGGACAC 
GCGGATTTTG CATACGAGGT GTCGCGCGCA ATTGCTGCCT GTGAGGGAGC GCTCCTGGTG 
GTAGATGCAA CGCa GGGAGT TGAGTCGCAG ACGATCTCAA ATCTCTACTT AGTTTTAGAG 
CACAATTTGG AAATTATCCC TGTTATCAAT AAGATcGnAC yCTAcGGcAG ACGTGnCCGC 
GTGTGCTCCA ACAGGTAGAG CACGACCTGG GCTTGGATCC CGCCTCTAGT GTGTTGATTT 
CTGCAAAAAC GGGAGAGAAT GTCGACGCGC TCTTTGATGC AATTATCACG CGTATTCCTC 
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CCCCGCAGGG GAGTGGTACG GCCGCGCTCC AAGCGTTAGT ATTTGACTGT CACTATGACC 27 60 

AGTACCGCGG GGTAGTTGTC CaCATTCGTG TTTTCGAGGG ACAAGTCACA AGTGGCATGG 2820 

TTATTCGTTT CATGAGCAAC GGGGCAGAGT ACCGTGTAGA AGAGACGGGT GTCTTTGTAT 2880 

TCAACCTTAT TGCACGTGAA GCGCTGTGTG CAGGAGATGT CGGTTACCTG AGTGCAAATG 2940 

TAAAAACGGT TTCAGATGTA CAGGTGGGGG ATACCATCAC AGACGCGTCC TGCCCATGTG 3000 

ACACGCCGCG TGCTGGATTT AGACGGGTAA AGCCGGTGGT CTTTTCCTCG GTGTATCCGG 3060 

TGGACACTGA TGAGTGTGAG CAACTGCGCG AAgcATTGGA GCGACTTGCC CTCAACGACG 3120 

CarTATTTCC TGGGAACGAG ACTCATCCTT AGCGCTGGGG cACGGATTTC GCTGTGGTTT 3180 

TCTAGGACTG CTTCATCTTG AAGTAGTGCA GCAGCGTTTA GAGCGAGAGT TCAACCAGAC 3240 

AGTCATTTTT ACTGCGCCTC AGGTGCAATA CTATGTGTTT CTAAAAACGG GACAGCGCAT 3300 

AGTGTGTGAC AACCCAGCCC ATTATCCTTT GGAGCAGGAG ATTGCACAGG TGCATGAACC 3360 

CTACATCCGT GCAACTATCA TTACGCCGAC AGAGGTGCTC GGTGCTGTCA TGACGCTCTG 3420 

TATTGAAAAG CGCGCGTACC AAACAGCGGT GAACTATTTA GATCAGAAGC GGGTGGAACT 3480 

GGTATACGAG ATGCCCCTTG CGGAAATTCT CTTTGGGTTT TACGATAGGC TCAAGAGTAT 3540 

TAGCCACGGC TATGCGTCTT TTGACTATGA GCTTATAGAG TCGAAGCTCA CAGATCTGGT 3600 

GAAAGTTGAC ATCCTTATTA ATGGGAAGCC GGTAGACGCG CTTGCGCAGT TGTGCTATCG 3660 

ACCGCATGCC CGCAGAAGGG CGCAGGCGGT GTGTGCTCGC CTGAAAGAGG AGATTTCCCG 3720 

TCAGCAGTTC AAGATTGCAA TCCAAGGCTC AATCGGCGGG CAGATTATCT CGCGCGAGAC 3780 

GGTTAGTCCG TTCCGCAAAG ATGTACTTGC TAAATGCTAC GGAGGTGACA TCACACGTAA 3840 

GCGAAAGTTG CTGGAGAAAC AGAAGGAAGG GAAAAAGCGA ATGAAGATGG TGGGGGATGT 3900 

GGAGATC C CG CAGACTGCCT TCCTGTCGGT GCTAAAAGAG GCTTCCGACG CCTAAGGGTT 3960 

TCAGCGCTGT TTTTTAGAGT CCTCTCCGTC TTGCAGGGGa TGTTGCAAAA GCGATGGTCC 4020 

GTCATGCTGC GGTGTAGACT TAGGTATCTG GATAAGTAGA CAGAACACAC ATTATACGCA 4080 

GCAAAAACAG AAAAAGAACA GGCGGGGAGG GCGACGCGCg CCCTCCGGGC CGCAcTAAaT 4140 

CTTAC CGATT AAa TCAATAC CAGGCTTCAA CGTCTTTGCT CCAGGCTTCC AACGAGCaGG 4200 

ACAAACtGAT CCCCATGCTT AGCCACAAAC TGTGCTGACT GAACCTTGCG CAAAAGCTCA 42 60 

TCCGCATCGC GCCCAATACC CATGTCGTGT ACCTCGAAAG CTTTCACAAG GsCTTCAGGA 4320 

TCGACCACGA ACGTACCCCG CAGCGCATGC CAAGTGTCTG GCAACAACAC TCCAAAGrAA 4380 

CCCgCAAGCT TTyCCGCCTT GTCAGAAATC ATCTCGTAGG GcAGATTCTT TATCGTGTCT 4440 
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GTCGCATCCG CCCATGCCTT GTGCACGTAC TCACTGTCCG TAGAAACCGA ATATACCTTA 
CAACCAATAA CTATAGGAAA CAAACGGGGA AA 
(2) INFORMATION FOR SEQ ID NO : 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
GTGGTTAGAT TCCCTTTTGG GGATGAGTTG GATGGTGCTG ACTC TGTGTG TCTGTGCGCT 
GTTGTTTTGC CTGAGGAGGA AGTACGTACA TCTCTTTTTT CCTCGTGGGG TTTCGGTGCA 
CACGCCCCCT GCGTCTTCGG ACGTGCGGAG TGTGTTGCCG GATATGCCAG TGAGAAGGAG 
GCGAGGAATC TTTGTCGTAC TCGAATGGGT TGACGCGCTC ACCCAGGCTG CGTGTTTCAT 
GCTTTTGGTG AATTTGTTCG CGTTCCAGTT GTACGTTATC CCGAGCGAAT CGATGGTCCC 
CAGCTTTATG GTCGGCGATA GACTCCTCGT GTTCAAGACC GCCTCAGGGC CTGTATTCCC 
GCTTTCTTCG TTTCGTTTGC CACGCTGGCG TACCTACAAG CGCGGAGACA TCGTCGTTTT 
TTCCAATCCT C ATT AC C C TG ACACTCCGCA GaTAAGCTCC GCGCCTTTTT AGCCCAATTA 
GTGTACATGC TCACCTTTAC GCGCAAGAAC ATTAATGTGG ATCCTGTCAC CGGTGCGCCG 
AAAGCTGATC CTCTCGTCAA ACGCATTGTT GCTCTGCCAG GGGAAAAAGT TATGCTCGTT 
GACGGTGTGC TCTATACGAA GACCAGGCAT GATGCGCACT TCAAGCCTGT CGCACAAGAC 
CGTACGTACG CCACGTGGGA TTTGAATGCG TTGCCCGCAC GCGATTTGGC GCGTGTTCAA 
CGGGTCATAT TTAATGCTGA GGAGCTCGCC GCCATCCATC TGGTAGAgCG CCTGCGCGCC 
CAGGTGGATT TTCGCGATTT AGCAGAGAAA ACGCGCGCGT TGGTTGCCCA AGCGCACGCG 
TaCGCGGGGg CGGCGTCACG CACCCGACAG GGCATTGGCG TGGCGCAACC GATAACGCAC 
ACATCTGACA TTCCTGCTTT ACCTCTGTTT GAAAAAGAAA TGCGCGGGGC GCGGGAGATC 
ACACAGCTCT TCGCCACCGT TGCAGACGTT GCCACGCATA TCCGCGACAC CTCCCAGGGG 
TTCGCnCAtT CGCTCACTTT GTGCAAAGCT GGATCCCATT TTGGGGGCAA GGAACGTATG 
GCTTGGACAC GGGACAGGAA GGTCCGTCCC tGCACCGCGC AGGCCTCTCG CTCTACCAGA 
TAAGATTTGC GCAGCTGAAC GCGTTGGTGA AGTACACGTT CGCCCAGCTA GTGGTAAAAG 
GCCTCCAGGT GACAGCACAC CGAACGTCGG AGGCTGGGCA GGACGAAACG CTCACTACAC 
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TTTTGCAGGA CGCGGCCCGG TACATCTTTT TCCTGGGTGC GGCGCGTGGA TTCAACATGG 
ACGAATTCCC CGCTGGCGCC GAGCAGTACC TTCCAGAACA CAACTACTTC ATGATGGGAG 
ACAACCGATT GAACTCTACT GATATGCGCC ACGCGTACAC CGAACACCTC GAGGCAATCG 
ACGCGCACGA CCCGTTCCCT ATTTTCTTTA GCTCCAATGT TGCGCCCAAG TACATTCCCG 
ATAGCCACAT CCTCGGTGTG GCGTCGTTCC GATTCTGGCC GCCCTCCCGC ATAGGCACCC 
CACAATAGGC TTAGGGGGAG CGGAGAGAAA GCTAAGAAAG GCGGAACCCG CCCAGGCGCG 
CAGCAGGcAG AAAGCCCGTA CCCTCAGACA sGTCCCGTTA CACAACAAGC GGGATAGGGA 
AGAGCCCTCG CTTACAAGGG CtCAGGAAAG ACATCCCCTT ATCGTGCGAA gcGCTcGCCG 
AATGCTTTAC AGCTTGCCTG GGCTTCTTCA CTGGGATCGT CGTACGCGAT CTCCCCCTTC 
CCCTCGAAGA CATCAGCGCC AgCCGCCTTA CAACGCTCGA CCCAGTTGAC CATCCATTCG 
CCGCCTTCCC CTTCTCCAGC CCACTCATAG GATCCGAAAA GCGCAACTTT TTTCCCCGAT 
AACCTTCCCT CAATAGAGGT AAAGAAGGGT TCAAACTCGC TTGACTCTAG CTCCTCAGAA 
CCAGctGCAG AGCAGCCAAA GGCGAAGsGG TCATAGGAAT CAAAAGTACC AACGTCGAAG 
TCCATGACGC TAAAAAGGTC AGCTTTTGCA CCACCGACAT TCAAACCCTC TACGATGCAG 
CGAGCCATCG TTTCAGTGTG CCCAGTGCCA CTCCAAAAAA TGACAGCAAC TTTTGCCACA 
AACTCCTCCT CGGGAACGTC ACGCAgTGGG TGcACTCGCA AAATAGTGCA GCCACGACAC 
GCGCACCCTG CCCGCGCAAG GGTAGGGGAA AGCTCTGTTG CTGTCAACCG CAGCCACAGC 
AGGATCCGgT GCCACCCTGG ACACCGGTAG ACTTGACGGG CCGACATTTT CCGGTACACT 
GGGGCCTGCG CGCCAACTTA GCTCACCTGG CAGAGCAGCA CCCTCGTAAC GTGCAGGTAC 
CCGGTTCGAG CCCGGGAGTT GGCTTTCTGT TTGGCGTA t T CCGCGCTGTG GGCCGGTAGG 
TGAGTCTTGG AAGAGGTGGG GGGGsGCGGg AACGGCGTGC TGTCCTGTTC CTACGCGTTT 
TTCCTCACTT CGGGTGGGGT GTTTTCCCTT TGAAGAACTG GGCAAACGGC TGkTATCGCG 
CGAAATCCTG TCCCGGCGCG GGGGATGTGC CCCGTGTCTT TGCGCGCTCA GGGGAGGTTT 
TCCCTTCAGG AGCCCGGGGA CGGGGTGCTC TCCGTGAAGG TGTCGCCGTG TGATCCGCAG 
gTGCACGCTG CTCCTGCCTG AGACTGAGGG AAATACGTCT CCTTGCCTCG TCAAGCCCGA 
TGATGGTGCA GGTGTACACG TCACCTACCT TCACTTTTTT GAGGGGGTTG GAAACAAAGT 
GGTCGCTCAT CTGCGACACG TGCAGAAGCG CCGTTTCCTT TATTCCAATG TCCACAAAGG 
CCCCAAAGTC CACCACGTTT TTTACCTTTC CCTGTACGGT TGCCCCCACT TTTAAATCTG 
CAAAGGATAT CAGACCTTGG CGCAGCACCG GTTTTGGATA ATCCTCGCGC GGGTCACGAT 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



Printed from Mimosa 02/03/22 07:29:20 Page: 727 



WO 98/59034 PCT/^^W3041 

726 

TAGGTTTTTG CAGCTCTGTA ATGATATCTT CGACGGTTCG ATCACTGACT GCGCATTGCG 3060 

ACTGCACCTG TGCCTTTTGc GCTGCGCTCA CTGTACCGCC TGCGCGCAGT ATATCAAAAA 3120 

TTATCTTTCC CGTTGCATAG TTTTCTGGGT GCACCCACGA GTTGTCCAGC GGGTTTGTGC 3180 

TTTCGGGGAT TTTTAAAAAT CCTGCACATT GCTCAAAGGT TTTTTGTCCC ATACCACTGA 3240 

CTGTTTTCAG TTGTTCGCGG CTAGTGAATA TGCCGT AG t G GCACGATGGT GCACGATCCT 33 00 

TTTTGCCAAC GCGCTATTAA CGCCAGATAC GTGCTTTAAG AGAGATACGC TAGCCGTATT 3360 

GAGATTAAGT CCTACGCTAT TGACTACAGC ATGTACTACC GCGTGGAGCT CCTCAGATAG 3420 

CTTTTTTTGA TTAACATCGT GCTGATAGAG TCCCACCCCA ATGGATTTCG GATCAATTTT 3480 

TAC CAGCTCT GCTAGAGGGT CTTGCAGCCT GCGTCCAATG GAGATTGCAC CACGGATGGT 3540 

CAGATCTAAG TCAGGGAACT CCTCTCGCGC AATATC TCCT GCTGAGTATA CGGAAGCTCC 3600 

GTCTTCCTCT ACCACGGTGA ATGCAACGGC AGAGTGTGTT TCGCTAATTA TGGAGGCGAT 3660 

AAGCTCCTGC ACTGCATGGG AGCCGGTGCC GTTCCCAACG GCTACGAGCT GAATGCGGTA 3720 

gCGATCAAGC GCCTGCGTCA AAGCGGCGCG TGCATGGTCC GTGTTGTGCG GATATATGAC 37 80 

AAAGGAGCCG AGATATTGGC CCGTTTCATC CAGTGCCGCA CACTTAGTCC CTGTGCGGAT 3840 

GCCAGGGTCT ATGCCGAGCA CGCGCGTGCC CTTGACCGGC TGGGTCATGA GCAGATTCGT 3900 

AAGATTTTCA CTAAAAACGT TGATACCGTG TTGCTCTGCC GAAGCGGTAA GGTCTGCGCG 3960 

TATCTCCCGC AGGACGGCAG GACTGAGCAG GCGCACCACG CCATCTGTAA TGGCATCGCG 4020 

ATGATACCTG TTGTTGGGGT GCACCGCCTC TTGAACCTGC TCGACAGCGG CGTCTAAATC 4080 

GACGGTGATT TTTACGTCAA GGATTCCCTC ACGCTCCCCC CGATTGATGG CTAACACGCG 4140 

GTGCGCCTTG ATGTCGCGCA CTGCCTCTGC GTAATCCCAA TACATTTGkT AGACGGACGT 4200 

GtGgCAGCGT GCGCGTcCCC GATTCCGGTA GCCGTAACGA CGCCTGCAGA AAGGTAAAAG 4260 

GACTTCAGTG CGGCACGAtT GGCGTTGCAG TGTGCGGTCT CtCTGCGAGG ATATCGCAGG 4320 

CGCCTGCGAT GGCG TCTTGA GCGCTGGAGA CGGCACGATC AGAATCTGCa GCAGGAGCGA 4380 

cGAgCGCTGC GGCAGCGCGC TCGATTTCTG CCTGCGTGGC GCACTGCGTT TCTATCAAAC 4440 

GCGCAAgCGG CTCGAGTCCT TTTTCGATCG CCTGCATGCC GCGTGTCTTT TTCTTTTTTT 4500 

TGAACGGAGC CCAGAGGTCC TCGAGTGCTG CAAGGGTAGG AGCGCTCCTG AGGTGCTCGT 4560 

AGAGCGTgGG GGTGAGCATG CCTTCTTTGA AGACGGCGCG TATAATCTCG AGTCTGCGTG 4620 

TTTCGCGTGC AAGGTGGGTG TGGAAGAGGC GTTCGCAGTC GCGGATGAGC ACCTCATCGA 4680 

GGCAGTGATG CGCTTCCTTC CGGTAGCGCG CAATGAAAGG AACCGTGCAG CCTTCTTTGA 4740 



Printed from Mimosa 02/03/22 07:29:21 Page: 728 



WO 98/59034 




GGAGGGAACG CACGGCAGTA ACCTGCGCGG TGCGGATGTG CAGTTCGCgc GcTACGCGTT 4800 

CTGCGAGCTC GTCCTCTTGC ACGCTGAGTG CGTCCACAAA GTCCTGGTCT AAAGTCATGA 4860 

GGGGGAGTGT AACGCGTTTG GCTCTTTTTA GAGAAGCGCC GGCCTGCAAC CGGCTCCGGC 4920 

GCGGACCCTG GCGTGGCACC GGCCAGAGAA GGGCGAGTGG AGAATAGGGG AGTCGAACCC 4980 

CTGACCTCTT GATTGCGAAC CAAACGCTCT ACCAGCTGAG CTAATTCCCC AGGAC TGCTG 5040 

GCTCCAGCTA TACACCAAAT ATGCGCGTCC TGCAAGGGTG TTTcCTGCGG GTGTGATGCC 5100 

CCTCGTGCAC CCTGTTCCCT GGCGCACTGC GGCGCGGTGT AGCGCTCTAG GCGCGTCGGG 5160 

GGGTGTTGTA GAATAGGCCG CATGAGCTAT TCGTGGAAAG TGCGCGCGCT GTGc TGCGC A 5220 

GGACTGTGTG TAGGTGCGGG GCTTCGTGCC CAGGAGGGCA GCGGAATTCG CGTGCGCGGT 5280 

ATGCCGGAAC ACGCGCAGGT GACCGTAAAC GGATATCTGT GCGCAACACC AGAGGAAATG 5340 

GTGCTCACCC CTGGTGAGTG TGAGGTAACC GTCTGTGCCT TTGGATATAC CAAAAAGACG 5400 

CTCCAGGTAG TGGTTGAGGA AgGCTCGTTC ACGGTGGTGG ATGGCCGTCT GGATACGGCG 5460 

CGTTTGGAGC TCACGGATGT G AC TG C GC AG AGGGCGCACT TTAATCCGCG GGATCCGGCG 5520 

GgACTGAACA CGGAgTACGT CACgTTCCGG GTGACAAAAT CTGCAAAgTG TACGGTAACG 5580 

gTAAAGGATG CCGAAGGAAA GGTGGTGTGC GAGGAGCCGG TGGAGTTAGT TGAGCTGGGG 5640 

TTGAACGTGG GGGGAATATT CGGGGGCAGT AATAAGAACA GCGAGGATGT TAGCGTTAGC 5700 

GCAAAGGTAG CGTTCGAAGG GAACGTTACG AGCGACCCGG CTATGGGCCA GCTCTATGCC 5760 

TCAGCGCTGT GTTTGTACCG CATCGTGCAC AACAACGATA GCAGCGGCGC AAACAAGTGC 5820 

TTCATGCGGA AGGGTTTGAC GTTTGCGACC ACCTGTGCGT ACGGCATTAA GGGATTCACC 5880 

GTCGCGCTCT CCGGAGAACT GGGTGCCAGT TCAGAGACGG GGATAAAAAA GCCGGACTTC 5940 

TCAACCGATG TCGGCCTGTC GCTCAAGTAC CAAAACAAAA TATGCTCCAT TGCCACGTAC 6000 

AGCAAGTGCG GAACCACCAC GGGGAGCAAT AGTGACGGAG CGAACAGTGT GGCGGGTGTG 6060 

TCcGGTTATG CGTGCTGCCT GCAAGTCTCG TGATGGGCTT GGAGAACAAT ACGCTTCAAA 6120 

GGTAACTCTT ACGAGGGCTG GGnAnTACGC GCTTCCATTG GGTACGTTAT CAACACGAAG 6180 

CTGAGAGTCG GGCGACCATA GCGGGGCAGG GTACCAGCCT GCTGCGATCG CGCGGGCAAG 6240 

TGCCGCCGTC AnCGTGCGCG GTCTGCGGGT ACGTCATACC AAACCAGCGC GCGCTGCTGG 6300 

GGTAGCAGCG CACCGTTCCT TTTCCCTGTG CAATGAGGCT GTTTACCGCC GCGGGCAAAA 6360 

GGTATTCCCG CTCGTGCTGC GGCGCGcGcs CTtCTTGCAC GAATGTCTGC CAGCACGCTG 6420 

CGAGGTGTTC GAACACGCGC GGAtGAAGCC AAAAAAGTTC ATAGACGCTA CTTCCTGCCC 6480 
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GGTGAGTGTG CAGGGCGCGC GTGCCGCTGT GGCGGGcAGT CTTCCGCGCG CTGTGnGrTA 6540 

ATGACCCGTC TGcCGCCGTG CGCGTGCCAG CCGATGTGCG TGTGTTCGTG TATGGCACGC 6600 

ACCAGGCGTC CTGCGGCGGG GACTGCGGGG GTGGGGGGAG AGACTGCGGC GTCAcGTcCG 6660 

CAAAGGTGCA GATACCGCGC GAAACGCCAC CGGTTTCGCT GAGCgTGTGC ACAAGGGGgT 6720 

AGCCGACCAT GGCGTGGCGT GTCGAGTCCA GCCCCTGCGC GGCAAGGTGC GCGGCAAGcg 6780 

TTTTGTAcgC GTCGcnTCCG TAGTAGTCAT CAGCGTTGAT AACCGCAAAC GGTGCAGTCA 6840 

GCTGTGTGCG TGCGCAagCA AGCGCGTGGC CCGTnCCCCC ACGGCGTGCG CGCGACAGGA 6900 



TGCGnCAGAG CGGCGCCGTG TGC 
(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6986 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



6923 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

ATACCTTACA ACCAATCTCC ACAAAAGACG GATATACGCG CGCGAGATCG TCAAGCTCGG 60 

TGGGGCACAC AAAGgTGAAG TCGGCCGGGT AAAaCmTAAA CACCGCCCAr cTACCCTTAA 120 

TGGATGCGTT AGAGACCTCC GTAAACTTCC CCCCGACATA CGCAGGamGc TTAAAGTCTA 180 

TAACCCTCTT GCCTATGAGA CTCTCCaTAG cAACCTCCCA AACGTTCGCA CTCTAGCGCA 240 

AACTATAGGA AACAAACGGG GAAAAGTAAA GTCTTTCACA CGTATCCCCC GGGTCACCAC 300 

ACCATCACGC AGTACGTTCA GAAACAAGGC GAACGCTGGC TTTTTCCTCT TGTGTAACTA 3 60 

CCTTCTTCGT AATGCATAGC TCCTTTTTCC CTTTCAGAGA CGGTGCCTCA AACATAGCAT 420 

CAAGCATTAA TCTTTCCACA ATAGAGCGCA AACCCCGCGC CCCCGTTTTT TGATCAATTG 480 

CCTGCTGAGC TATTGCGTCC AAAGCGTCCT CATCAAAGAC aAGACGCACG kCATCCAACG 540 

CGAATAACGC TTCAAACTGA CGGACAATAG CATTTCGCGG TCGTACCAAG ATATTGCGCA 600 

GATCCTCTTT AGAAAGAGCA TCCAAGGCGA CCGTCACCGG CAGACGGCCG ATAATCTCTG 6-60 

GGATTAATCC AAATTTCACC AAATCATCCG GAATGACGTC CTCGTGCATC AGTTGCAGAC 720 

CTCGCTCCTT TACCGTTTTT ACATCTGCTC CAAAGCCAAC CGGATTCTTA CACAcTCTCG 780 

TACcGACAAT ACCATCTAAC CCAACGAACG cACCACCACA GATGAACAAA ATGTTCGATG 840 

TATCCACCCT GAGCATGTCt TGGTTTGGAT GmTTGCGACC CCCtGCGGAG GCACcGATGC 900 
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TATCGTTCCC TCAaTTATTT TCAAAAGCGC cTGCTGAACC CtTCACCCGA CACATCACGC 
GTAATAGACA CGTTCTCGCT CTTACGCGAA ATCTTATCGA TTTCATCAAT GAAGATAATC 
CCCCGTTCTG CGAGGGCAAC ATCTCCGTTC GCATTCTGAA CGAGCTTTAA TAAGATATTC 
TCTACGTCCT CACCCACATA ACCGGCCTCG GTGAGCGTAG TAGCATCTGc t ATCGC AAAG 
GGGACCTTCA TTTTCTGAGA AAGTGTCTTA GCCAACAGCG TTTTGCCTGA ACCTGTCGGC 
CCAATAAGCA GCACGTTAGA TTTTTCAATC AATACCGAAT CAATATCCAA AGACCTACCT 
GCCACCCGTT TGTAGTGGTT GTACACCGCA ACCGATAGCA CCCGCTTGGC CAAATCCTGC 
CCAATAACGT ACTGATCAAG GTAAGCTTTC AACTCTAAAG GAGTTGGAAT CTCCTCTTTG 
GTCATGAGCG CAAGTGCCGA CGGCTTGCGA TCACGCAAGT ATTCTGCACA CCGCTCCACA 
CAATAATTGC AAATAGAAAC CCCATGACCG GTCACAATCC GGCGTCGCTC ATCTTCTTTT 
TTTCCACAGA AAGAGCAGCC CAATACCAGA TCCCCCTTAG ACCTGAGCAT GCTTTCTCCT 
CTTCATTACC GTATCTACGA TACCATACGA ACACGCCTGC TCCGCGGAAA GGAAGAAATC 
TCGCTCCATA TCCTCCCGCA CCTGCTCCTC TGACTGCCCA GTGTGCAAcG CGAAATACGC 
AATCGTCAGC GTCTTTAGGC GCAGGATCTC CTGCGCCTgG GATGCACACA TCACTTGCCT 
GCCCCTGTAC GCCACCCCAC GGTTGATGGA TCATCACCCG AGAAGACGGA AGCGCAAAAC 
GCTTGCCAGG CGCACCTCCT GCCAGTAACA CTGCTGCCAT ACTCGAAGCC TGTCCTAAGC 
AAATGGTCTG CACCTCAGGG CAAATGTGCT GCATCGTATC GTACACTGCA AGC CCTGc AG 
TAACCGCCCC GCCAGGACTA TTAATGTACA GGCTGATATC CTTATCTGGA TTCTGAGACT 
CTAGAAAAAG TAAc TGCGCT ACAACTAAAT CCGCCACCGC GTCAGTGATC TCCCCGTCTA 
CGAAAATAAT ACGGTCCTTC AACAAGCGGG AAAAAATGTC ATAGc tCCGC TCTCCACCCC 
CCGACTGTTC AATCACGTAG GGAACCAGAT TATGCATACG TTCACX3CACG GGACTGCTCC 
TGAAGAAAGT CAGTCAAAGA CTGCTCCGGC CCACACTCAG TCACACATCG TCCGAGCAAT 
TTCTGGCACA GCTTCCGTTC TCGTATTCCT TCACACAGCG CACGCCGTTT TTCCTCCCCT 
GCATAATACT CGCGTACCCG CTCCTCTTTG GAACCTGTTT TGGACGCAAT GCGTACGTAC 
TCCGTCTCAA TTTC CTCAGC AGAAACAGAC ACCTGCTCCT GCTTAAGAAG GAGCTCAACA 
ATCACACGCT GCTTCAGGTG CTCTTCCACC TCCGGACGCC ACTGCTGAAA AAACTGCAGC 
TTATTCTGCG GGGTGCCCGA CAGGCTCACC CCAAACTGAC GCATCACCAA CGCCCAACGA 
GACTCCATCT CCCCCACAAC CAAAGATTCC GGCAGAGAAA AAGGATTCTC CCGCACCAAT 
ATACGCAACA GCTGCCGCCT CTTATACTCG TGCAGCGCTG CCTCCAAcGC TTCCGCGAGG 
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TTTTGCCGCa aCTCCGTGTC AGATCGTCAA GTGTGCGAAA AGCATCGCTC ACATCTTGCG 2700 

CAAGcTCATC ATCAAGACTC GGCAACTGAC GCTGCTTGAG CGCCTTAAGC GTTACCCTCA 27 60 

CCTGAGCGGC TTCGTCCTTC AGCATACCGG CCCTTTTAGC AAAGAGACAC CGCTGTCCTA 2820 

ATTTCATACC CAATATATCT TGCCCAAGCG CAAAGGGACC TTCCTCCACC CCAAGCGTAA 2880 

AGACAACGCC GGCGCGCTCA GTACCCGGAC GAACGGCACC TGAATCGTCA ACCTCGTGAT 2940 

AATCGACGGT GGCAATGTCC CCTACCTCTG CACACGAATC TGCACCCTTA TCAGTAACCA 3000 

GCGCATTGCG CTCCTGAATA CGCGTTAACT CTCGAGAGAC GTCCTCTTCT GTGACCGACA 3060 

CAGTGGGCAC GGACAGCGAA AAGCCCGATG TGTTGCGTAG TTcAACGGAA GGAAATACGT 3120 

CGTATATGAC AGCAAAaGAG AAaTCCTCGT CAGGATCGAA CACTGGCTTT TTCTTAAGCG 3180 

AAGGACGGGA GATAGGAAGA GGCTGACTGT CCTGCGACGC CTGGGCAAAC CCCTCCTCCA 3240 

GAGCTTTTTC CATGAGGGCC GCCGCTGCAT CTTGCCGAAT AGCACTTCCA TACTTCCGCT 3 300 

CAAGCACTGC AAGAGGAACT TTCCCCTTGC GGAAACCAGG AAGCCGCGCA CGCTCAAGAT 3360 

ATTCCTCAAC AAAACGCTGA TAATGcCGct GCGCATCCTC GCGCGCGACG ACCACCTCTA 3420 

GCTCAACCTG AGATTGTGCA AGCGCGGTGA ATTTTTTTTG AAGTTCCACA AGCCCAGATC 3480 

CTTAGGAAGA AATACCTACG TCCGCAACGC cTCGCACGGT CCAAGCAGGa TGCAGCAAaG 3540 

CGCTGaAAAA GCGGGAAACG GGGATCGAAC CCGCGaCTTC CACtTGGCAA GGTGGCGCTC 3600 

TACCACTGAG CTATTCCCGc ACAGGCGCCt GCGAGAGGAG GGACTTGAAC CCTCATGCCA 3 660 

GAGGCACTAG ATCC TAAGTC TAGCGTGTCT GCCGATTCCA CCACTCTCGC ACGGAAGAsA 3720 

TsCGGCAAGC AAAAACTCGC CCAACAGGAT GCAGACACCC AACCGCCCCT GAGCCATGcA 3780 

GGCTTCGAAC CTGCGACCCA CAGATTAAGA GTCTGTTGCT CTACCAACTG AGCTAATGGC 3840 

CCGTCCTCCG AGACCCTCCC CCCAGGATCA CATATCATGC AAAAAGGATC AAGATGAATC 3900 

GTATCGTCGC GTCCCACGCA CCTCCTCTTT TCGCTCAACA TTTCCTTCAA TCAGTCCAAA 3960 

CCTCTAGGAA GATATCCAAG TCGCCGAACA CAACAGGGGC GTAGTAGGGG ATTGACTGTG 4020 

CAGTCACTGG GTCCGTCGGG TTCACCCTAC AAGGAACACT CCGTTTGCCG TACTCGTTCC 4080 

GCATAGGCCC TGTCTTTAGG TATAAACTCC CTGCCCCATA CGCCGGCACC CCTTTTTTGC 4140 

AGGAACGGTT TCCTAAACGA GCTATCCGTG CTACCCTGGC AGCCGACCAG GGAGGGCGCG 4200 

TATGGATCAG CATACACGTA CACGCGATCT TGTTTCTGCA TTTTTTGGGC GCTTTCACTT 4260 

TGATGTCCAG GGACCTTCCG TCCGCACGGT TGTCGACGTG TTGCGCGCAG ATATGGTGCG 4320 

CGGCTTAGAG GAAGAGGCGC AGCTTCCTCC CCGTATGGGG AGTGCACTTG CGATGATTCC 4380 



Printed from Mimosa 02/03/22 07:29:24 Page: 732 



WO 98/59034 




PCT/ 




731 



CACTTGGGTG GCGCCCCCCC GTGTATCCCC CTGCAACCGA CGCGTGATAG TTATCGACGC 
TGGAGGAACC AACTTTCGCT CGTGCCTCGT ACGCTTCGGC GACAGTGGCA CACCTCACAT 
CGAGAATTTA GAAAAACGTC CCATGCCCGG TACCACCCGT GAGTACTCAA GGACAGAGTT 
TTTTGGAGAA ATTGCAGACA ACCTGGCACG TCTGAAAGGT GCAGCGGACT GCATTGGCTT 
TTGTTTCTCT TACCCTATTC GTATCAGACt GACGGTGACG GTGAGGTTAT TCAGTTTGCG 
AAGGAAATCA AAGCTGCTGA GGTCATCGGC ACGTGTGTCG GTGCTGGTTT GACAGAAGCG 
CTAAGTGCTC GGAACTGGCC TGAACTCCGT TCTCTCAAAA TGCTCAATGA CGCAACGAGT 
GCGCTGCTTG CAGgTTTTTT TGCGGCACCA GAGGGGTGTT CGTTCAGTTC ATACGTAGgT 
TTTATTCTTG GCACTGGAAT GAATTCTGCG TATCTGGAGC CAGACCCTAT TCCTAAAATT 
CCTGCGCATC ACACACCTCA GGTGGTAGTG TGCGAATCGG GAAAAAGCAA CAAAGTACCG 
CGCAGTGTCT TTGACGAATT ATTCACTCAA ACTACTGCCG AGCCGGATAT TGCACACCTA 
GAGAAGATGT CCTCGGGCAC CTACCTCGGT CCCCTTGCTT CCGTTGTCGT GCGGCTTGCG 
GCACAAGAAG GTCTTTTCTC ACACGCAGTA CACGCTGcAC TCAGTACGGT TTCCTTTACA 
CTCGTGGATA TGGATCGTTT TTTATTTGCT CCCTCTGTGT CCACCACCAC GTTGGGCGCG 
TTGCTCGCAC CGGGCACCGA CACAGACCGA GAGATTCTCT TTCTTTTGCT CGATGCGGTA 
kTTGCACGTG CAGCACGCAT CGCTGCGGGA GTAATCGCCG CCTCAGTATT AAAAAGCGGT 
GCTGGGTATG ATCCGCTTCG TCCCGTGTGC G TGCTCGC AG AAGGCACCAC GTTCCAACGC 
ACCTACCGCC TACGCACCCG GGTTACTTCC CACCTGCAAG C CTTTTTG AC TGAGGAGCGC 
GGTGTGTATT TCGATATCAT TTCACTTGAA AACGCCGTAff CGCTCGGCTC TGCACTCGGA 
GGACTCAGTT CGTAGGCATA TGCCTAAACG GACTGATGAT CCTGTGAGAG ATAGCGCCGT 
GCAGTGCTTC CTGTCATCTT CTCGTCGGCC GCGTGTGGCT GAGCGGCCGT GCTCGCCTTC 
TGGTGCGAAC GCGCTCTCCC TGTCTCTAGG GGAGTAACTT CCACGCCGAG TGTATCTTCT 
CAATCTTGTA CACGAGTAGT GCCTTCCCTC CATGGTTCAT GATTACATCC ACTCGGGTCG 
GGCTCTCGAA GACAATGGAG TCCACCCGCA CGTTTTGCCG CGACGGCACG AACACGTGTA 
TAAAATAATC GCGTAGtGCC GCAAACGAAC ACCCTTCTTT GGCAGGGCGT CCGAGCTCTG 
CTGCAATATG TCAGGCATTG AGTACGCGCG CCGATATGCG TCGGAAAGAT ACACAAGCCA 
CTTGTGATAG TCACGTTCGG CAGTGATACG ATTCAAATGC GCCACCACAT CTTGCAATTC 
TGCCTTTGTA CGCTCATAGT CTGAGCGCGT GATGCGCACG GTGCCGAAGT GGCGACAACG 
CCCGCACGCT CCTGCGGACT GTGAACATTC ACCTTTCCAT CGTTCTGCAC CCACTTCTGG 
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ACGCGGGGCT GTGTGCATGA AACACTGCCC CACAATACAC TTCCGATTAA GTAGCATACT 
TTTCCTCTTG CAAATGCGCT CCTCACTACA CGCCCACCAG CGTACACAGA GTAGGATCGT 
TGAAGAGCTT TGCGTACCCC TGTGCCGACA AGCC CAACTT ATCCTCCAGG TACGGATTTG 
CACCATGATC CATCAATAGC CGAATTAATA CATGATCTTT CCTACCCACT GCCAACACCA 
GCGCCGTTTG ACCATTTGAA CCTCGCACGT TTGGATCTGC TCCTGCATGC AAGAGCAGAC 
GCGCAACAGT TCGGTTCCCA ATTTGAGCTG CTTCCATCAG CGCAGAATAC GCGCGATCGT 
CAGATAACTG ATCTACTGGc gCACCGCGCG CAATAAGTTG CGCTGCCATC TCATCCTGAC 
CCTCCCGCAC TGCCAAAGAC AACACAGsGT ACCGCGTGCG TCTTTCAACG CAGCGCTAAA 
TCCTGCATCC AAAAAGAGAT TGACAATATC AATATTCCCA TCCATGACTG TCGCGATGAA 
ATTTTCTTCA AAACATGGAT AACCGCGCTC TAACAGCGCA GTGCGTGCGA CACGCTTCTT 
TTTCTGCCTT AC AAATCTc T CGTGCTCGAC ACGAAAGAAA TCCTcAAACG TCTCCTCCTC 
AAGTAAAAAG ACCAAGTCGC GAAATACATG GATATCCCTG ACCTCCGTTG TTGTAGCCAA 
GAGCAGCACG TGCATACCAC GCCCACAAGC AACTCCAGAA AAGAGnATAA AAGCCGGATC 
GCGCATGGGC TCATGCGGTA CAAAAAAAAC ACATGCGTTG CATCCTGTAC CAACGCCAAA 
GGGTAACTGG CACGGTGGAT GTGGTC 
(2) INFORMATION FOR SEQ ID NO: 118: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
ACTGTCTGCC ACGCTACACA CACTCACACA TCAGTATCTT AAATACGAAG AGTGTTCAAA 
ACAGCTTGCA CAAAAGACGC AAGAAAGCGC AAAGCTTATA ACTCTTTCAG ATGAACTGAA 
TGGGATAAAC CAAAAAAAAA TACAATTTGA CGCATGGGCA CTCATTTCTT TTCTGCACGA 
AATTACTGCC TACGC AAACA TACGTTTGCA AAAAATGAGT GAAGGACGTT ACCATCTGAG 
GGTAGCTGAC TCGCACGTCA ATGCACGAGG ATATCAAGGA CTTGCGCTGC TCGTTGCAGA 
TGCGTACACT GGGAGCGTGC GCCtTCGGCA ACACTTTCAG GAGGCGAAAC CTTTATGGCC 
TCTATCAGTC TTGCACTTGG TCTTGCAGAT TCTATCCAAA CCCGATCGGG AGGTATTGTG 
CTTGACTCGC TGTTCATAGA TGAAGGATTT GGAAGTTTGG ATGAGGCAAG TTTAGATAAG 
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GCAATTGGCA TCTTAGATGA AATCAGAGAG GGAAGTCGCA TGATAGGCAT CATTTCTCAT 540 

GTTCATGAAT TGCGCACGCG CATCCCTCAC AAAATTCTGA TAAAAAAAAC AAACGCAGGA 600 

TCACACGTAA TGCAGGGGGA TGCAGAATGA AAACGAGCGC GCTCTTTCTT GATTTTTACG 660 

AATTGACTAT GGCGCAGGGA TACTTTTTTC ACAAGCCGCA CGAGTGTGCG tGTTTGAAGT 720 

ATTCTTTCGT AAACACCCCT TCGCGGGAGG GTACTCCATT TTTGCAGGAC TCGATCCGCT 780 

CCTGACGGCA ATAGAGCAGT TCCGCTTCAG TGGAGAAGAT ATCG ATT ATT TGCGCACCTT 840 

GC AC TTATTT CATGATGACT TTTTGTCTTA CCTTGCTTCC TTCCGCTTTT CAGGAGATAT 900 

ACACGCGCTA GAAGAAGGTT CAGTAATATT TCCTCACGAA CCGATCATCC GCGTGCACGC 960 

GCGCTTGGTT GAAGCACTTC TGCTTGAAGG ATTGATACTC AACACCATTA ATTTCCAAAG 1020. 

CCTCATCGCA ACAAAGACTG CACGGATGTG GCGCGCGTCA GGTGAAGGTG TTCTTATGGA 1080 

GTTTGGCCTC AGAAGAGCAC AGGGCTATGA CGGCGCGTTG AGCgCCACaC GCGCTGcTGC 1140 

AATAGGTGGC GCAACAGGGA CAAGCAATAC ACTTGCTGCA AAGCTcTACG GTATTCGGCC 1200 

AATGGGAACT ATGG cGCACG CgTGGGTGAT GTCTTTtGAC AGTGAAGAAG AGGcCTTCGA 1260 

ACGCTATGCT GCACTCTATG GAAGCGCGTC CGTATTCCTC ATCGATACGT ACCATACCCT 1320 

GGG 1323 
(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3076 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

TACTnCTACT TCCATCCCCT CCAGAACTCC TCCCgGATAT TCCtTCATCA GTACATGGAA 60 

TGCTCCCCTA CCGCCTTGCA CCAAAAATGC AAGACCCGTA GCTTCGGTAT ACCGCTTAGC 120 

CCCGATACAT TATCTGCGCA TGCGTACTCG ACCAGTGAGC TATTACGCAC TCTTTCAaGG 180 

AATGGCTGCT TCTAAGCCaA CCTCCTGGCT GTCCAGGTAC CCACACTTCA TTTCACACTC 240 

AAGCGGTATT TCGGGACCTT AGCTGACGGT CTGGGCTGTT TCCCTCTCGA CTACGAACCT 300 

TGTCGCACGC AGTCTCACTC CCACACGTTG ACTGCCGGCA TTCAGAGTTT GATTGGGTTT 360 

GGTAGGCGAT GAAACCCCCT AGCCCATCCA GTGCTTTACC TCCGACAGTT TTGTATAAGG 420 

CTGTCCCTAA AGGCATTTCG GGGAGAACCA GCTATCTCCA GGTTTGTTTA GCCTTTCACT 480 
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CCTAGTCACA AGTCATCCAT ACCTTTTTTA ACAGATTATA GTTCGGTCCT CCACAAGGCT 540 

TCACCCCTGT TTCAACCTGC TCATAACTAG ATCACCCTGG CTTCGGGTCT ACGACGTACA 600 

ACTCACCACG CCCTTTTAAG ACTCGGTTTC CCTCCGGCTC CAGGACTCCT ATCCCTTAAC 660 

CTTGCTGCAC ACCGTAACTC GCAGGCTCAT TCTACAAAAG GCACGCTACC ACCCTCACAG 720 

GCTGTAACAT CTTGTTGGTT TACGGTTTCA GGTTCTATTT CACTCCCCTC ACCGGGGTTC 780 

TTTTCATCTT TCCCTCACGG TACTTGTCCA CTATCGGTAG TTGTCGAGTA TTTAGCCTTA 840 

GATCGTGGTC GACC C AGATT CCGACAGGAT TCCTCGTGTC CCGCCGTACT CAGGTACCGC 900 

ACCAGCAGGT CCGCCCCATT CCGCATACGG GGATTTCACC CTCTCTGTCA GGCTTTCCCA 960 

AAACCTTTCT GCTATAGGCC GGATTATTTC ACCCACCGAA CGCAAGCCCG CGCGGCCCTA 1020 

CAACCCCTGT TAGACACAGG TTTAGGCTCC TCCAATTTCG CTCGCCACTA CTTTCGGAAT 1080 

CTCTCTTGAT TTCTTTTCCC AAGTTACTTA GATGGTTCAG TTCACCCAGT TTCGCCTTAC 1140 

CCTCCCTATT CATTCAGGAA GGCAATGACA AGGCTTTACC TGTCGGGTTA CCCCATTCGG 1200 

TCATCCCCGG ATCACAGGAC ATGTGCTCCT CCCCGAGGCT TTTCGCAGCT TATCACGACC 1260 

TTCATCGCCT GACAACTCCA AGACATCCAC CGTAAACCAC TATTCGCTTG ACCATATTAT 1320 

CCATCCCTTC TCAACTTCAC ACCCCACCCT AATACTCTCA AAAATCACCT ACCACCTACT 1380 

CCTTACCCCA TAAACAAAAC AAaGGGACAT AAaGAATAAT AGTGGGCTTT CCCTGGAGAT 1440 

AgGGGACTCG AACCCCTGAC tACGACCTGC AAAGCCGTCG CTCTAGCCAG TTGAGCTATA 1500 

CCCCCTTTTC AAAAGGGAAG GGGAGAGACT GCCGTGCAGG AGCAGAAAAA CCtTaAGtGG 1560 

CTTCCGCCAC ACGnCGAACA CGGCACCATG CCATGCCCAT ACCCTTTcTC TTAGAAAGGA 1620 

GGTGrayCCAG CCGCACCTTC CGGTACGGCT AC CTTGTT AC GACTTCACCC TCCTTACCAA 1680 

ACATACtTCG GCACCGCCCT CCtTGCGGGT TAGGCTAGTG ACTTCGGGTA TCTCCAACTC 1740 

GGATGGTGTG ACGGGCGGTG TGTACAAGGC CCGGGAACAC ATTCACCGCA CCATGCTGAT 1800 

GTGCGATTAC TAGCGATTCC AACTTCATGA AGTCGAGTTT CAGACTTCAA TCCGGACTAC 1860 

GATTGCCTTT TTGCGGTTTG CTCCACTTCA CAACCTCGCA TCGCTCTGTA GCAACCATTG 1920 

TAGCACGTGT GTAGCCCCGG ACATAAGGGC CATGATGACT TGACGTCATC CCCACCTTCC 1980 

TCCGGTTTGT CACCGGCAGT TCCGCCAGAG TCCCCAACAC CACTTGCTGG CAACTGGCAG 2040 

TAGGGGTTGC GCTCGTTGCG GGACTTAACC CAACACcTCA CGGCACGAGC TGACGACAGC 2100 

CATGCAGCAC CTGTCAAGAG GCGTATcGct ACGCCACCGC ATTTCTACGG CGCTCCTCTT 2160 

GATGTCAAAC CCGGGTAAGG TTCCTCGCGT ATCATCGAAT TAAACCACAT GCTCCACCGC 2220 
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TTGTGCGGGC CCCCGTCAAT TCCTTTGAGT TTCACTCTTG CGAGCATACT CCCCAGGCGG 
TACACTTAAT GCGTTCGCGT CGGCGCCGAG ACTCATGCCC CAACACCTAG TGTACATCGT 
TTACTGTGTG GACTACCAGG GTATCTAATC CTGTTCGCTC CCCACACtTC GCACCTCAGC 
GTCAATCATC GGCCAGAAAC CCGCtTCGCC AC CGGTGTTC TTCCAAATAT CTACAGATTC 
CACCCCTACA CTTGGAATTC CGGTTTCCCC TCCGTGATTC TAGACCAGCA GTACCCAGTG 
CAGTTCCCAA GTTGAGCTCG GGGATTTCAC ACCAGGCTTA CCAGTCCGCC TGCATGCCCT 
TTACGCCCAA TAATTCCGAA CAACGCTCGC CCCTTACGTG TTACCGCGGC TGCTGGCACG 
TAATTAGCCG GGGCTTATTC GCACGACTAC CGTCATCAAA CGGGCATTCC CTCCCGTCCT 
CATTCTTCGT CGGCAAAAGA ACTTTACAAT CTTTCGACCT TCtCATCCAC GCGGTGTCGC 
TCCGTTCAGC TTTCGCCCAT TGCGGAATAT TCTTAGCTGC TGCCTCCCGT AGGAGTCTGG 
GCCGTATCTC AGTCCCAGTG TGTCCGGTCA CCCTCTCAGG TCGGATACCC ATCGACGCCT 
TGGTAGGCCA TTACCCCACC AACAAGCTAA TGGGTCGCAG GCTCATnTCT GAGCGAGGCC 
GCAGCCCCTT TCCTCTCAAA GACTACGTCC AAAAGAGCGT ATTCGGTATT ACCCCCTATT 
TCTAGAGGCT ATCCCCATCT CAAAGGCAGA TTACCCACGC GTTACTCACC AGTCCGCCAC 
TCTAGAGAAA ACGAAA 

(2) INFORMATION FOR SEQ ID NO: 120: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1091 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
CnGATGTGCG TAAACGCCTG GAGACGCTCA AGAACACGCA ACGTCAAAAG GGCAGAACGC 
CGCGCCGCAC GgTCGAAAAC TCTTTATAAC TCAT AGAGAC ATCCTGCTCA GCACCCGCAC 
CGTCGGATAT ACAGCGTATA ATAACAAAGG GGACGCCATT AACTGACGCT ACATGCGCAA 
AGGCTGCCCC CTCCATTTCC ACACCATGCG CACCAAATTC GCGTATAATG CGTGCACGTG 
TTTGCGCATC TGACACGAAA AGATCCCCTG AGGCGACGCG CCCTTCGACT AAACGAGAAA 
CGCGAGAAGG CGGATCCCCC GAGCCAGAGA GCGCACAAGC ACCCTCGGTC CACTCCGGAT 
CCCGTGTGCA GAGATCAAAG GCTTCCCGCA CCAAATACCG CAATGCCGTG TTCGCAGTCC 
ACTCTACAGA ATCCATGCGC GGAATACGCC CTTTCTGGTA ACCAAAAGCG GTAAcGTCTA 
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CATCATGCTG CACTGCATCG ACAGAAACTA GCACATCAAA AACACACAAG CGCTCATCGA 
GAGCACCTGC AATTCCTGTA TTGATAAGCA CACGCGCACC AAACTCCGAA ATGAGTAGTT 
GAGTGCAAAG CGCTGCATTC ACTTTCCCAA CACCGCCGCA CACATACACC ACCTGAAGCG 
CACCCACCGA CACAACATAG AACGTGAGCC CTGCCCGCTC TGTACCTACT CCCCCGAGAC 
ACTCACGTAC GCGCGCAACC TCCTCTCCCA GTGCAGCAAA AACGCCGACC GTCACGCACC 
CTCCCCGTGA AAAACACGAA AACGCGCACT CGCAACCCAG GCACGGAAAA AAGCTGTCCC 
TTGAAGGTCA GGAAAAAGCC CCGACCACAA GGCACACCGA TAAATGAACG GAATATAGCA 
GGGAGAGGAC TCGAACCTCC GGCCTCCGGG TTATGAGCCC GACGAGCTGC CAACTGCTCC 
ACCCTGCGGT GACGCACAGA GCGTACCACG ACTAGAGCCC GAAGTCAAGC CACAAAGCAG 
GACGCTCCGC CCCAGCTTGA AGCGGAGCCT TACAATCATA CATACGACCA GAGGATACGA 
CACGCAGTTT A 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19186 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
ACGACCTCGC CTGCAAAATC GCTATCCTTT CCCCATCTTT ATAGGTGAAT AGACCCGGCC 
AGGGAGTAAA GGCCCGAATC TTGCGCTCCA ACACAAcTGC AGGATTAcTC CAGTCCGCCA 
aTCCCaTCTC CyTACAGAGC TTACCACAAA aTGTTGCCTg CGAgTGaTCC TGtGCTGCAG 
GGGCGAGCGT GTGCCGCTCA ACACCGACTA AAACATCATC CACAAGATCG GCAGCCACCA 
AAGACAAACG AGAAAGAAGC GCGCCAGTAG TCTCTGTACC GTCGAGCTGA ACACGGGAtG 
CGCAAGAATG TCCCCTGCAT CCATCTCCTC ACCAATGTAC TGGAGCGTCA CACCCGTCTC 
GCAGTCCCCC GCTAAAATCG CCGCAGGGAC CGGTGTACTC CCTCGCCAAC GCGGCAACAA 
CGAAGGATGA ACGTTAATAG CACCGCGCGG GAAAAGCGCA AGGAACCGAG GGCCAAAAAT 
CTTACCGTAG GCAAAACACA CCAGCACGTC CGGACGCAAA GCCTCCACCG CGTCATAAAA 
GGCGCGATCC AAACGCCCCG GAACGAAAAG AGACGCAGAC TCAGGAAGAA CCCCAGACGC 
TTTTAAGCGA AAAAATTCCC TTGCAACCGc AGAATGGaCA AGCTTcCCCG aGCGTcCGaC 
AGCAGCAGGA GGATTCGTcA AAACCCCCAC AACCCGaTGA GCGcACGCCA CCCGGCGCAA 
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AGAAGGCACG G c AC ACTCTG GAGTTCCCGC GAAGAAGACC CTcACCATGG GCTAACGAAT 
CGTCCCGCGc AGCGC gnC AT ACCGACGAAG GGCATCATCC CTCCGCTTTT CGTCAATGCG 
ATCCAAAAAA AGAATACCGT CAAGATGATC GTACTCGTGC TGGATCACTC TTGCCAGAAT 
ACCATCTGCA TCAACGGCAC AACGTTTACC ATTCTCGTCG AGGTACTGCA CACTCACCCG 
ACGCGGACGC AACACCCTTT CATAAATGTG AGGAATGCTC AAACAGCCTT CCTCGTAGGA 
AGATTGCTCT TCAGACGCGG CAGTGATCTG GGGATTGATA AAAG CGCGG A CGTGGTGCTC 
AACATCTACT ACAAACACGC GGACGGTACG CCCTACtGCG GCGCCGCAAG CCCCACACCG 
CCTGCCCCAC GCATCACACG AAACATACCC GAGATGAACG CGCGCAGCTG CTCGTCCACC 
TCCGAAACCG GCTCGGAAAC CGTTGTCAGG CACGGCTCAC CTAAAAACTT AAGCTCCACC 
GCCTCTTCCC GCTCTTACCC AAGAGAAACT CATACCCGCC TAGTACAGCC TTGAGTCGCT 
GACCTTGCGC CCCTTCTTCA TCAGtTGCGC CGCAACGCCT TCTTCTTGCG ATTCAGAAGC 
GTGGAAGGCT TCTCATAAAA TTCTTTTTTC TTCCACTCGC GAATAATACC TTCCTTCTCC 
ACCTGGCGCT TGAAGCGCTT AATTGCCTTC TCTAAATTCT CAGAATCATC CACCGTTATG 
TGAGCC AC CG ACCCTCCCCT GAAAAAAACA CAGCCTACCA GGGGTAGnTA GCGCAAAAGA 
CGTCCCCTGT CAATCGTTTC TTCCGAAAGC ACTCCTGCCT CCCCTGCACG CACACCTtAC 
GTGTCTTGCT TATGCAACCG CTGACCTTCC TAACGTGCGT CTACTTGTGG TACAGTGCAG 
CGCTGTGCGT AGAAGTCAAA ATTGTCCGCA CCGACCTCCT TTCGAGTGCA TCCGTGGGAC 
GCTAGCCTAC CCTGCACAGG CGTCCACCCC TCCTGGGAAG AGGCCCGACC TGC CGTGAAC 
ATTACC CGAA GCCGCGTTGC GGTC TTTATC TCCTCCCTGA CCGCTGTGCT TCTGCTGCTC 
ACCGTTCAGT GCGCGCGGTA TATGCTCATG CGTGGCAACG AGACAAAGGA ACTGAACACA 
CTCACTGAAC GCGGCGCGAT CTTGGACCGA AATGGCCGGT TTCTTGCCGT TGGAACCACC 
GTCTACAACC TCAGTGTTAA CAAAAATCTT GTCTCAGACC CACGCACTGC AGCCCACGTG 
TTAGCACAGG TCCTTGACCT TTCAGAACAA GATATTGAAG AAAAATTCCG CACCGCGCGC 
GCTCACTTCT TTTACCTCAA GAAAAAAATG AGTGAAAcGG AAAAGAACCT TGTCGCTCAC 
GCTCTTAAGG AGCACTCCCT GAAAgGATTT CGCCTAGAGG CAGTGCGCAA CCGCATCTAT 
CCAGAAAGTA GCCTAGCGTC CACGGTCATC GGATACGTAG GTGATGACGG AAGGGGACTG 
AGCGGTATCG AGTACAC t TG CAGGATGTTC TTTCTCCTGC CCCGTACCAC ACCGGGTATA 
CGGGCAAGGG GCATACTGTC ACCCTCTCGA TCGACCGAAC CATCCAGTAC ATGATGGAAA 
AAATCGCAGA TACTACGCTC CGGCGTACCC AGGCAGAAGG ACTCATGTTC CTTGCGGTGG 
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AGGCAAAGAC AGGTCAGATT CTATCCTACG TCAGCAAGCC GTCTGCTAAC CTTTCACACT 
TTTCCCAAAG TACCCCTGCC GAACGCTTCG ATCGCCCCGC CCTTTTCATC TATGAGCCTG 
GCTCTGTGTT CAAAATTTTT TCCATCGCTG CACTGTTAGA ACTCGGGGTA ACTTACACCC 
ACGACACGCT CCACTGCGAC GGTTCCTTCT CCTTTACCTC CCCC TTTTT A AAACCAGGTC 
AAAAAGGCCA TCTCATCCGC TGcCTGCGCC CACACGGcAC CATCAGCGcT GaAGATATCA 
TCCGGcTTTC GTGTAATGaC GGcATGGcAC AAATTGcTGa ACGTGCCGAC AACCACAGCT 
TTGAGCAACT ATTGCGCGCT TTTGGATTTG GCGCGAAAAC AGAAATTGAG TTGCCGGGGG 
AAACCGTCGG TCTCTTCTCT CCCTCAGAAC GCTGGTCCCA CCGCAGCAAG CACACCATCG 
CAATCGGCCA GGAAATTGGC GTCTCTGCCT TGCAGtTGTG GCTGCCGcTA CCGCGCTCGC 
CAACGAGGGC GTACCGCTCG GCCTCTCCCT CCTCCATGAG GTCACTACCG CCGAAGtACC 
GTGGTGTACC GGCACAAAAA GAAACCCAAA ACACGCGTTA TCTCCGCAGT AAATGCGCAA 
AAGGTGTTGC GATACATGCG CACCGCCGCA GAACTTGGCA CCGGGAAAAA GGCGCTCGTA 
GACGGGGTGC CGATCGCAGT CAAAACAGGC ACTGCGCAAA TGGCGCACAG AAATGGTCGT 
GGGTACAGCG ACACCGACTA CCTTGCAAGT TGCATCGGCC TTTTCCCCGC GCACGATCCA 
GAAATTATCT TATACATTGC CATCATCCnt CCTATCGGAC AAGCC TATGG AGAGCTCATT 
GCAGCGCCTG TCATCTCTCA AGCGGCAAAC GAGATTATCG ACTACCGCGG TATGGTCCGT 
GCCAACGCCC CGTTAATCCA ACATAGCGGT CTCATCCATA CGTCAGAACG GACACCTCCA 
CGGTTAGGAA CCCATATGCC AGACCTCACC GGTCAACCTA AACGTTTACT CCTGGATATT 
GCAAAACGCA CCGACGTGCA CCTTGTCcTT ACAGGAGAAG GTTATGTGTA CGAGCAGCAT 
CCGCCTGCGG GCACACCTCT GACAAAAGGA ATGACCATTG AACTCAAACT CAAATAAAAA 
GCGAGATCCC GCGCGTTTCC CGGCCGGTGT . TGCGCAAGGC TGCAGTACCA CACGCGCAGG 
GGATCTGAAA CACAGAAGAA AGCATCCCTT TGAGAAGTTT TTTGCGCAAA ACAGCGCTCT 
TTTTGCCCAG CGGTTTCCAG ATCTTGCACG TGCGCTGACA CTTCCAAACG AGCAGCTCCT 
GCAACGCATC CCTCCTGATT ACCTCCTTGC AGCGGCCCAT GACGGAGACG CAACGCTTGC 
AGTACGGGGC ACCTATCTCC ACTCAAAATA TCGGCCGCGG CAAGAGGCTG CACGTCTTAT 
CAGCCAGGAT TTTTTTACGC ACGCGATTGC AAAAGGCGGC TATGTAGGTG CAGGTTTAGG 
TCTTGGCTAT GTAGCAGAAC TGTACGCGCA nAGCCACCCT ACGCACACGG TAGTGCTTAT 
CGAGCCAGAT ATATTCGTGT TCCTGCTTTT TTTAGCCAtA GACyTyTCAC TCCCCTCcTC 
CGACACGAAC GTCTAAAAaT ACTGCCTGCa CAGACGGTAC CGGATGTGTT GCAGTTCyTG 
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CGCGCCACGG GGGATGTGTC TCTCCCCCTG TTCCATTTCT TACCAGCCCA GGAGCTAAAC 
ACCGCGTGGT TTCACGATTT TACCCAGGCG CACCGGCACG CAACTGCACA GGCAGAAACG 
AATAAGCAAA CGCTGCAACG GTTCGGTCCG CTTTGGATAC GCAACACTAT AAAAAATGCC 
GCACAGCTGT GTGTGCGCAC GCCCGTAAAT GCGtGCGCAA tGCGCGGCAG GGAGTACTGC 
ACTCATCATT GCAGGTGGCC CTGGGGTGGA TGCGAGTATA TCGTTCCTTC CGTACCTAAA 
AAAAAAACAC TGATTATTGC GGTTGAaTAc CGCGTTnTGg CGTGTGCaGC GCGCGCGTCA 
CCCCGGACGT TGTGCTTCTG TTTGAtCCAC AATACTGGAA CTATTTGCAC GTAGCACGTG 
CGGTGGCGCC CCATGCACTG CTTATCACTG ACATTTCTGT GTTCCCTGCa GTGTTTCGAC 
TC CCGTGGG A GTACATTGTC CTTGCaGGTT CGGCATGCCC TTTCGCTACA TCCCTTGCGC 
ACCATGATGC CTCCTGCTCT CCTTCCCACG CTCTTCCTCT CCCTTTTGTC GCGCCCGATC 
TTCTCGCCTC AGGTGGGTCG GTTGCTACAA GTGCGTGGGA ATGTGCCCGC TACTTGGGAG 
CGACAACCAT AGTGTACATA GGGTTGGATC TGGCATTTCC TGGAGCGCGC ACGCACTTTC 
GCGGAGCGCT GTTTGAGGAG CGGGCACACC TGCAATCGGG TCGAGTGGCT CCTGCCGAAA 
CTACCTCCTT CTGTGCACTG CACTCCCTGC CCCTATATCC TGTCCCGGCT GCATCTGACC 
CCCATCCAGG AAAGAACTCC CCTGCTTnCC CTACCGGGGA GAACAAAGAG CAAACGGTGC 
TAACCGaCGC GCGTTTTTCA CTGTACGCCG TGTGGCTCGA AGCACACCTT GCGCGTTATA 
CGCACATTAA AACGTATGCG CTTGAGCCTG CAGGAAGGAG AGTTGCAGGC ATCACGCCCT 
TGCGCTTTTC CCAGCTTGTC ACGTTACTCA ACCGGTCTGC AGCTGTGCCG CACTGCCGGA 
CCATGTATAG CCGCCGACGC AGGCTATACT CCCGGTATAG AAACAGCAGC GTGCAGGTCA 
ATTGCACCGA TGCGGTGCGC TGGAGGCGCA CGATGAAGAG TACACCTGTG CACTGTCTCG 
GTCATTTCAC GAAGAAAAAG AAATCCGCAG AAGGTGCACG GAACCTATGG CGTGCTCTCA 
GACGTGCGGA TACGCGGTGC GCTTCTCCCC ATAACCTCGA GCACGCACTT GAGAGAACTC 
GTTCCTTCCT CAACGCGATG CCACTGACAC CCACAACGTA TGAAAACAAA ACACATGCAT 
TGTACACAGC GCTGTGCACG CTCCTCCCGA CCGAGCCTAC CTACCGCGCA CGGGCACACG 
CACACCTCTT TGAACTCCTC ACCCGAACAC TCAAGTTTTG CGCCGCCTAT ACCGAGGAGG 
AGGGGGAGTG GCGCGAGGCC tGACATCCGG GGCCACTCAA AGACGTGTGT GGGGCgCaGG 
GGCaGGGCCC GTGCGTTCCC CCTTTCAGGG AAAGAGCCAA GCGCGTGCGT CCTGCATCAG 
TTTTAGAAAC ACCAAGACGT CTACGTTTTG CTTCTTGCGA TTCTCGTAGT CCGCCCAGCG 
GCGGGGCGTG TAGTGTGCAT GGACGTCTGC CCTCACCTGT GCTGCATGCG CACAGCCGCG 
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CGCAATCAGC TGGGCGTCGT GCGCGGGGGC GCGCCTAAGT TTTTCAGGAA GCACTTCTGA 
CAGAAGGTAT GCACTGAGTA CGCGAATCAA CGGtGACGCT TGGTGTGCAC CAACGTGAGA 
AGATGCGCGT GTGCCTGTTC AAAGTCGCCG CACAGGTGAC ACGCGAATGC GAGGTAAAAG 
CGAGTCCACG GCTGGACGGG CGCCTGTGTG GCGGCAACGC GCGTCATAAA GGTACGCAGC 
TCCGAGTACT TTCCCGCAAG GAGCTTGCCA ACGGCAAAGG TGAGCGCGCA CTTCATGATA 
TACCGAGGGC TGCGCGTGCA CAAAATGTTC TAGTCGCTCA AGTGCTTCAA AGTCACTTAA 
AAGGATCAGT GACTCTGAAA GGAGAGACAC CCTCCGCAAG GCAACCCTAT TGCGGGTGAA 
CACTTCCTCC TCCAAAAGAG CTGCCAATGC CG AC CAATCT TCCGCATCCA GGTATCGAAG 
CAGTCTCCTG TTTTTAACAT AGAGTGTATT TATCAGCAAA AAGAGTAGGA GAAAAATGAG 
AAAAAAAATC CCATACGCGC GCACAGATTG CCAGCCTGTA CCCCGGACAG GAGTGAGAAA 
CAGAACATAT CCGGTACCCA TACACAGCAG CAGGAGTGCA CCGTTAAAAa GCACTACTAA 
TGCTTTGATC TTCATGGGAA aTCTCCTATA CTGCCTcCCC GCCTGTGCGG GGCGCCATTC 
TACTATGGGG GCGAGCGAAA CGTGAATACG TATGCAATTG TAGATATACC CGAGGAAACC 
CACTTCAGCG ATGACGTGTA TCTTGATGAC ACTTTTTTAG TGCTGACCCC CGCAGgTCTC 
TTTGACGAAG CGCTAAAGAG TCGAC TGCAG GAGTGGGACT TCACCACGGT GCACACCGGc 
GGACAACTGC TTTCTTCTGA GACTACACTG GAGTGTCCTC TGCCTGCATC CGGCGATCCA 
AACGCGAAGG GTGCTCCCGG ATCCGAGGCC TTGCGTGCAC AACGAAGCCG TTTTCACGCA 
CTGAAGAAAC AGTACAACGA GTTTCAAATG TTCGTTGAGC ACGTTTTTGA TCAGTATCGT 
GCAAAGCAGa GCCTGAACAC GCGCGCGGTC ATTGACCGTG CGAAGATGCT TTGCGAATTG 
GTAAAAAAGC ATCGGcAAAC GCTACTGCGC GTCCTTCCCA CCATTCCCTA CCGGGAGCAC 
TACGCGCTTG AGACACATGC ACTCCGTTCG ACGTTGTATG CTGTTGTCAT TGGGTTACAG 
CTAAAAATGC AACCGTTCAA GATCATtGAG C TCGCAAC AT CTTGCCTGCT TCACGAAATT 
GGCATGGCGC GCGTTATCCC GAAGCGTACA CCACGGAGGG GGAGTTAGAT CCAAAAACGC 
AGAAGGCGAT CTTTGCGCAC CCTATTATCT CCTACCATAT CCTGCGTGAC CACTCGCTCC 
CTCTGCCGGT GTGTGTTGGG GCACTTGAGC ATCGCGAGCG CGAAAATGGG CTCGGCTACC 
CGCGCAGGCT CGTGGGAGAA AAGATTTCCC TCTATGGCAA gAtAAtCGCA GTCgCtTGTT 
CtTACgAGGC AGCAACCGCT CCGCGCTCAT ACAAGGAAAT GAAAAACGCC GCTGAGGGCA 
TCGTCGACCT TGTGCGTAAC GCGAACACCC AGTACGATGC AGTCATTTCG CGCGCGCTTC 
TCTTTGCCCT GTCGTTTTAT CCTATCGGGA CCCATGTGCA CTTATCAAAC GGGAAAATCG 
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CGCAGtGGTG GACGTTAACC 
AaTTCACCGC AACGGCAAGC 
GCGGGCCCTG AGCGTAAAGG 
CGCAGGGcAG ATGCCCCTTC 
TGTTCGCAGC GAATAACTTC 
CGCGtTGCGA TGTACTCAAG 
TAGgTcCAGC TAGAACGCGC 
ACTCGCGCGC TAACTGCGGC 
TTGAAACTTT TCGCTCATCC 
AGAAATATCG TTCCCTGCAC 
TCCTGAAAGC AG C AC ATC T A 
GTCTTCAGGA ATGTGAAAAG 
TTTGTGCACC TTGTCTGCAC 
TACGCAATAA TCCGGCGCTC 
ATTGTCTTTT CGACCGCTTC 
ACCGCGAGCA ACGCTGCCTC 
GTGATGCGCG CCACTGCCTT 
ATACGCAGAA TTGCCTCTCT 
AAACGACCGG GGCGTAGGAG 
AAAATGAGAC CGGTGGTGTT 
TGTTCCCGCT CATCGTTGGA 
TCATCGATAA AAATAATCCC 
CGCGAGGcGC CAATCCCCAC 
GGCACTGACG CCTCACCTGC 
CCGACCAACA GCACCCCGCG 
TTGAGAAAAT CAACTACTTC 
GCAAAGCGTG TGGTGACCTT 
AAAATACTGC TCCCCAGCCC 
ACAAAAAAGA GGATGAGCAG 
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CCGACGACCC ACGCTTCCCC ATCGTGCAGG TACACGGAGA 7740 

CCATCATTCA CAGCACGAGC GCAGACGAAA TTTTTATCAC 7 800 

AACAGCGTCT TATCCTTCAG GAAGGTGGCG ACTGACCCTG 7860 

CTGCTGCTGG CTTAGACGCT TTTTTAGCCC CTCTAGGTCG 7920 

CTCAAACTCG TCGCGCTCGA TGGTTTCTCG CTCCAAAAGg 7980 

GAGCTCTTTT TTCTCCGTCA AGAGTGCTAC CACCGCGCGG 8040 

GCCACTTCCT CATCAACGTA CTGCTGTGTG CACTCCGAAT 8100 

TCCGCGAGAT ACCCGGTTCC GCGGCGAGTA AGTGCAAGgT 8160 

CATAATCTGT AATCATCTTG CGGACAATGT CTGTTGCGCG 8220 

CAGTTGAAAC TTCCCCAAAG GCTACAAATT cCGcTGCGnk 8280 

CTTCTGCCAA CAACTGCTGC TCCGTAACAA TATGCCGATC 8340 

TATATCCAAG CGCAGAGGTG CCCCGGGGAA TAATTGTAAT 8400 

CCTTCGTGAA GGTACCTGCA AGGcATGTCC TGTCTCGTGA 8460 

TTCTTCTCGA ATTACCCGAC TTTTTTTCTG CAATCCTATC 8520 

GTCCAAATCC GTTTCAATCA CCTGCGCACG CCCAGACCGT 8580 

GTTCACCACG TTTGCCAAAT CAGCACCTGA ATACCCACCG 8640 

CAAATCCACT TCTGGCGCTA ACTTCACGTT CTGCGCATGA 8700 

TCCCTTAAGA TCGGGCCGAT CTACGCAAAC CTGTCGGTCA 87 60 

CGCAGGATCT AAC AC ATCGG GGCGATTGGT AGCAGCAAGC 8820 

ATCAAACCCA TCCATTTCTA CCAGAAGCTG GTTAAGCGTT 8880 

ATGGATAGCG TTCAGGCGGC TTTTTCCAAT TGCGTCAAGC 8940 

TGGCGCCTTC TCCCGCGCTT GTTTGAATAA ATCGCGCACA 9000 

AAACATTTCG ATGAAGTCTG AGCCACTGAT GCGAAAAAAG 9060 

CACTGCGCGT GCAAGCAACG TCTTACCCGT CCCTGGGGGA 9120 

GGGAATTTTC CCCCCGATTT CAGTATACTT TTTAGGGAAC 9180 

CATCAGCTCT TCTTTTGCCT CATCCACCCC TGCAACGTCT 9240 

TCCTTCTTCC ACCGCCgCAG AGCGCGCGTG TCCGGCTGAG 9300 

GCTTACATTT GAGGCCATCC GCTTAAAGAA AAAGCGCCAG 9360 

CGGAAAGAGA TATTGAAATG TCTCTATGAG GTAATTGCgC 9420 
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TCGCGCGGCT TAATACTGTA GACCACCTGC CGCTCATCGA GCATGCTCAA AAAGGAATCG 
GAGAGGACAC CGATGGCATG ATAGGTAGGA GCCTCCCGCT CAGAGAGCAA CGAGAAACCC 
CGCGCAGAAG GCGCAGGGCG CGCGGAAGTG TACCCGACAA AGTAAGGGGA ACCGACAACA 
ACCTTTACGA TTTCCCCACT TGCAATGCGA TCTTTAAATT CCGAGAACGG GATGATGCGC 
AAAGCACGGG AAAACAAAAA GTGGTTTGCA AGAGCAAGCA GCGCACAAAG AGCAAGGAGC 
ACGAGCGAGA GCACCTTACG CGAATTTCTG CGCGGAGGTC TTTCGCGTGA CGACGAAGGG 
CCTTTTTGAG GGCGGGGGGA GAACTTAAAA AACCCGAATG GATCTGAAGA ATCATCTGAC 
TGTTTGTAGC GCGTATTCAT CTC AG TTAAG GCTCCCTTTT GCAGTGCGCG CGGGCACGCC 
GGCCCATGAT ACACGAAAAG AAGGTGAAAC GTCGACAGAC TGTGTGAGAG CCGTACGGcg 
CAgCTTAGCG ACGTCCGCTG GGGCGGTATG CACGGTGCTG CAAAAAAACA CACGCGCGCG 
GCTATTGCTT ACCCTCATGC GCGCGTTCAC CGAAACTTCA CCTAATGACA CCTACCTATC 
ACCCGTACCC GGGCGACAAT CGCCCCTTTC T AT AC GCCGC GCTTCAGCGA CGTGAGCACC 
TGTCCCTTAT CCGGTACCGC GCAGAGCACG CGCAGGACCT GGCACCCTTG AGAGCGTTCT 
TGGCGCGCAT CGAGGCGCAC GCAACCGTCA TTGGTGCACG CACGCGTGGG GACACCCTTT 
TTATCCTTGC CGCATCGATG CCGTCTGATG CACTGCGCGA CGAAAAACAC GCATATGTGC 
GCACAATCTC CTGGGAACAG GCTCCTCAGA TACTTGAGAC GTTGGAGCGT CCGCCGCTCC 
CTCCTTATTG CCCGCCCGTT CCTACTTCCT GTTCTTCTTC GCGTCTTATT CCCGACGTGC 
CGCACAACAC CAGGTCACAC GCGCAGGAGA GTTCCTACAC CTCTCGGCAT GCGCTTTTGA 
CGCTGCTCAT TGAGTGGCGC GCGCTCATGG TAGAGATGGA CTATTCAGTG AGAGCGCACA 
GGGTGCAGCG TAGTTCTGCT CCGTTGCATG AAAGACACGG CACTCTTCCC TCTGACGTAC 
TGCTCTTCCA AACACAGGGG GAGGTCTGCG CTCTCTGTGC CTTTCAGTTG CAGCACGTGC 
GCGCAGTAGG CGGGCAGCGT CATCTTATCA TTCATGAAGC GGCAGGAGGC GGGAACATTG 
CATGCGAACG CATTTTCTCT CTCAAGGAGA TTGATTTTGC CACTGCAAAA TTTACTGAGC 
GTATCCGTCG CGGGCTGTAC CAGGTTGCTG TGCATACGGC ACACGCAGAC TTTACGGTCA 
ACCTCATCGT TCCCTCACTC AGAGAGCAGG GCGGCGC CT A CTCCCTTGCA GAGTCTTCTG 
CCTTTCACAG GAGGGTAAGC GCGTAAACTG CAGATTACGC AAAACTAGCG GTTCTAGCCC 
GGAGGGtGAA GCGCGCCGTG TGCATCGGCA cGACCCCGCC AGAGAAGGCG GGCGCCCGTC 
TTCCCCGCGG TgGCGsGCTG CGCCCTGCTA CCCCGTCTGC TGTGGAGCGG GGCCTTGTCT 
TTTTCCTGAG GGCTTGTTAC gcTGCGCGCC AGTCCCCGAG GAAGAAGGAA TTGCTATGAG 
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TAGAGGTATT TCTACCTTCA GGAATATCGG CATCAGCGCG CACATAGATT C TGG AAAG AC 
AACCCTTTCT GAGCGCATTC TCTTTTACTG TGATCGTATT CACGCCATCC ATGAGGTGCG 
TGGTAAAGAC GGTGTTGGCG CCACCATGGA CAACATGGAG CTTGAGCGGG aGcGCGGTAT 
TACCATCCAG TCTGCCTCCA CCCAGGTCCA GTGGAAGGGA CACACTATAA ACGTCATTGA 
CACTCCCGGG CACGTTGACT TCACCATCGA GGTGGAGCGC TCCTTGCGCG TTTTAGACGG 
TGCCGTCCTC GTACTCTGTT CGGTTGCAGG CGTCCAGTCC CAGTCCATCA CTGTCGACCG 
GCaGcTCCGC CGCTATCaCG TGCCCCGTAT CTCATTTATC AATAAGTGTG ATCGTACGGG 
TGCCAACCCt TTCAAGGTCT GCGCTCAGct GCGCGAAAAG CTCTCCCTTA ACGCGCATCT 
TATGCAGTTA CCCATTGGGC TTGAAGACCG TCTAGAGGGT GTCATCGATT TAATTTCGCT 
CAAAGCCCTT TATTTCGAGG GAGAAAGTGG CGCGCACGTG CGTGAGGCGC CCATTCCCGA 
AC AGT ATC AG GCAGATGTGA AAAAGTACCG GGATGAACTC ATCGATGCGG CGTCTTTGTT 
TTCTGACGAG CTTGCTGAGG CCTACCTTGA AGGAACTGAG ACCGATCAAT TGATTCGAGC 
GGCAGTACGT GCGGGCACCA TTGCAGAAAA GTTTGTCCCG GTTTTTTGCG GTTCTGCGTA 
CAAAAATAAA GGTATTCAGC CACTTTTGGA CGCTATCACA TACT AC C TGC C AG ATC C T AC 
CGAGGTAACT AATACCGCGC TCGATCTGGA TAGAGCCGAG GAGCCAGTTA CCCTCTCCAC 
CGATGCAGAC GC AC CGGT AG TTGCGCTCGG GTTTAAACTA GAGGATGGCA AATACGGCCA 
ACTCACCTAT GTGCGTGTAT ATCAGGGGAC TATCAAAAAA GGGGCTGAGC TTTTTAACGT 
CCGCGCGCGC AAGAAATTCA AGGTGGGCCG TTTGGTACGG ATGAACTCTA ACCAGATGGA 
AGAC ATC AG T GAGGGAACCC CCGGAGACAT TGTGGCGCTT TTCGGCGTGG ACTGCGCGTC 
GGGAGACACC TTTTGCAGTG GAGATCTGAA TTACGCAATG ACTTCGATGT TTGTTCCAGA 
GCCGGTCATC TCGCTTTCCA TCACTCCTAA GGACAAGCGG TCCGCTGACC AAGTTTCCAA 
GGCGCTGAAC CGGTTCACCA AGGAAGATCC TACCTTCCGC AGCTTCGTAG ATCCTGAGTC 
TAACCAAACT ATCATCCAGG GGATGGGGGA GTTGCACCTG GATGTGTACA TTGAGCGCAT 
GCGACGCGAG TATAAGTGTG AGGTGGAGAC GGGCATGCCG CAGtGGCGTA TCGGGAGGCA 
ATTAGTGCGC GCGCGGATTT TAACTACACC CACAAAAAGC AAACCGGCGG TTCCGGGCAG 
TTCGGGCGTG TGGCCGGCTT TATAGAGCCC ATCGCCGGGC AGGACTATGA GTTTGTAGAT 
CAAATCAAGG GAGGAGTAAT CCCAAATGAG TTTATCCCTT CGTGTGACAA AGGCTTTCGC 
ACAGCGGTaA AGAAAGGAAC TCTTATTGGT TTTCCGATTG TGGGGGTGCG CGTTACCATT 
AACGATGGGC AGTCTCACCC GGTTGACTCC TCAGACATGG CGTTCCAGGC GGCAGCGATT 
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GGTGCCTTTC GTGAAGCGTA CAATGGGGCA AAGCCAGTAG TCTTAGAGCC AATCATGCGA 
GTGTCGGTGG AAGGGCCCCA GGAGTTCCAA GGCAGTGTCT TTGGGTTAAT TAACCAGCGG 
CGGGGAGTGG TTGTATCGTC AGCGGACGAT GAACAATTTT CCCGCGTGGA CGCGgAGGTC 
CCGCTGAGCG AGATGTTCGG GTTCTCCACC GTGCTACGTT CTTCCACACA AGGTAAGGCT 
GAGTATTCTA TGGAGTTTGC TAAATACGGC AAGGCACCGC AAGGTGTGaC GGACTCGCTC 
ATAAAGGAAT ACCAAGAGAA ACGAAAAGCA GAACAAAGGT AAGCGTAACG TGCTAGGCGG 
CGCGTCCTTC TCGACGCGGT GGCGAAGTCT TGAATAAGGG GGCTTTCTGG TGTAcCCTCC 
CGGGCCGAAC GGTACTCTCC TCACATGAGC CGAGGAGGTA TCACGTGGGA GGTTAACATC 
ATGAATGCTC ATACGCTTGT GTACTCCGGC GTAGCACTTG CCTGCGCGGC TATGCTCGGC 
TCCTGTGCCT CGGGCGCCAA GGAGGAAGCT GAAAAGAAGG c t GC AGAGGA GCGTGCGCTT 
CTGGTCGAGA GTGCGCATGC TGACCGTAGG CTTATGGAGG CGCGTATCGG CGCGCAAGAG 
TCTGGCGCAG ACACCCAGCA CCCCGAACTT TTCTCCCAGA TTCAGGACGT TGAGCGCCAG 
TCTACCGACG CCAAGATTGA AGGGGACCTC AAGAAAGCTG CCGGTGTCGC CTCAGAAGCT 
GCGGATAAGT ACGAGATTCT CAGGAACCGA GTTGAAGTTG CTGACCTACA ATCTAAGATC 
CAGACTCACC AGctTGCGCA GTACGACGGG GACAGCGCGA ACGCTGCGGA AGAATCGTGG 
AAGAAGGCAC TTGAATTATA CGAGACCGAT AGCGCGCAGT GTCTGCAATC CACCGTCGAA 
GCGC TCGAGT CGTATCGGAA AGTCGCGCAT GAGGGATTCG GCCGCTTACT ACCCGATATG 
AAGGCACGTG CGGGTGCTGC AAAGACGGAC GTTGGCGGTC TTAAGGTAGC CGTCGAGTTG 
CGTCCACAGC TGGAAGAAGC TGACAGCCAA TACCAAGAAG CACGTGAAGC TGAAGAGGTA 
AATGCACGTG CCAAAGCTTT TAGCGGGTAC CACCGTGCCC TCGAGATCTA CACAGAACTG 
GGGAAGGTTG TACGCCTGAA GAAGACCGAG GCGGAAAAGG CGCTGCaGTC TGCAAAAACA 
AAGCAAAAGG CGTCCTCTGA CCTTGCGCGG AGTGCGGATA AGAGTGCCCC aCTTCCtGAA 
AACGCTCAGG GTTTCTCAAA GGAGCCGATT GAGGTAGAGC CGCTTCCAAA CGACAGGCTT 
AACACAACGC AGGCAGATGA GTCTGCGCCG ATC C C CAT AT CTGACACCTC TTCACCTTCT 
CGCGTGCAGT CTCGGGGTGT TGAAGACGGA GGACGTTCTC CAAAATCCTC TATGAACGAA 
GAAGGAGCCT CTCGATGAAG ACACGTAATT TCTCGCTCGT ATCCGCGTTG TACGTACTGC 
TGGGTGTTCC TCTGTTTGTG TCTGCCGCTT CCTACGACGA CAATGAATTT TCTCGCAAGA 
GTCGTGCGTA CTCGGAGCTT GCAGAGAAGA CATACGATGC GGGAGAGTAT GACGTCTCTG 
CAGAGTACGC CCGGCTCGCT GAGGATTTTG CGCAAAAATC CTCGGTCTAC ATCAAGGAAA 
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14580 

14640 
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CTATGGCGCG 

CGAAAAATGA 

TCAAGACCGG 

TAAGGCGTTG 

GAAGGAGGAg 

AGCCCAGAAA 

CGAGGCACGA 

GCGTGCGGAG 

AAC TC GAG AA 

GGGAATGTTT 

AGAAGTTGTA 

AGCCTGAGAC 

AGCCCAACGT 

CCCCTTTTCC 

GAGCGCTGGA 

TCCACTTTTT 

AACAATgCTT 

TGCCATTCCC 

TGGAAAAACG 

CCAGGGGCCT 

GCTCCCTGAG 

CGGAGCAGCT 

TCATCGATCA 

ACGAAGCGGA 

ATGCAAATAA 

CTATTGCCTC 

CGCGCCCGCT 

TTGTGCGCCT 

CGGACGCCGA 



CACCACTGCC 

GCGCATCGAT 

AGGcTCGCTT 

GACGCACTCA 

GCTGCGCGCA 

GCGCAGGAAG 

AAACTCGAAG 

GAGGAAGCTG 

GGGTCGTGTG 

CTGGAATATT 

TGAGGCGAAC 

AGTCCTGGTC 

GAAATACCGT 

ACAAGGGTGC 

ACGCTACGGT 

GCTACACTTC 

GCAGCGGTGC 

CGACTGTTGG 

GyCCCTTCGG 

TAGTGCTTGT 

GATACAAAAA 

GCG TAATCTC 

TATTGAGCGC 

TGAGATGCTA 

AGACGCACGC 

TACCTTCATG 

CATTGAACAA 

TATTGATGTG 

cACTGTTGCG 



GAGGACGCTA 

CGCGCCTATC 

TTGACAGCAA 

AAAACGTAAA 

AGGCCGCCGA 

AACGTAAGCG 

AACAAAGAAT 

CGCGCAAGGC 

CTACCTGCGC 

GCCAAAAACC 

AAGGACAAAA 

ATCCCCAGTC 

CCTCTGCCGT 

AAGGGGCGTG 

GTCCaGCGCT 

CCTTCTATGG 

GACTCAAGGG 

CAGGGGATGC 

CCTCCCCCTT 

TCCTACAAGG 

ATACCTCGGA 

GAACAGGGTG 

GGTTCTC TCG 

AACATGGGTT 

GTCCTTATGT 

GGAAGCTACG 

TTTATGTGGG 

AGCGACAACT 

AAATCTCTAG 



745 

TGAACgcTGC GCGCACCCGC 
CGACCGAGTA TTTGCTCGCT 
GCAGTACGAC GTAGCGCTCA 
GCCTGAAAGT CAGTTGCTTG 
GGCACGAAAA CTCGAAGAAC 
TGCGGAGGAG GAAGCTGCGC 
TGCAGCCCAG AAAGCGCAGG 
CGCCGAGGAA GCAGCGCGAA 
AATACAAGGT GACTACGTGG 
CCGCCGTTTA TGGCAACCCC 
TTCCTCAGTC CAAAAACCCC 
TCAAGGGAGA GGAGCGCGAG 
AACGGATAGA CAAGAGCGTA 
GTTGGGAGCC CATAGAGAAA 
CTTTGTGTGT TTTTGTCTCT 
ACGTGTCCTT TGAAGAGCTT 
GTTTCGGTGC CCAACTCCCA 
GAATATCATC GCAAAAGCCC 
ATCCAAGAAC TGGGAAGCCC 
GAGCTCGTGC GCAGGTCGCA 
TTCACACCGT GTACGGTGGG 
GAGAGATAAT AGTAGGAACG 
AGCTGTCTTA TCTGCGCTAC 
TCGTTGAGGA TATAGAG TCT 
TTTCTGCCAC TATGCCCAGG 
AGGTTGTTGA AGAAGTCACT 
TTGTAAGGGA CGC TGACAAA 
TTTACGGTCT GGTGTTCTGT 
ACGAACGCCA TTACCATGTT 



CACGCGTGGG 
AGCGAGGCTA 
CGTGGGCGCG 
CAAAGGCCGC 
AAAGAATTGC 
GCAAGGCCGC 
AAGAACGTAA 
AaGGCGGAGG 
TCCATTGACC 
TTCCTCTGGA 
AATTGGGTAG 
GGTCTGTATG 
TACGCTTTTT 
GaGCTyCCCA 
ACAAGAAAGT 
GGTTTGAATG 
TCCAGGCTGC 
GAACCGGGAC 
GTGCGAACAC 
AGCGAACTGA 
GTCTCCATCG 
ACCGGGCGCG 
TTCATATTAG 
ATCTTCTCTC 
CAGATCCTTT 
CCAGAAGAGG 
ATCGAgGCGC 
CAAACCAAGG 
GCTGCACTTC 



3041 

14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
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ACGGAGATAT TCCGCAAAGC CAGCGAGAAA AAATTCTCGA GCGCTTTCGT ACAAAACGAG 
CGCGTATCCT CGTCGCCACT GATGTTGCCG CTCGCGGCAT TGACATCGAA GGAATTACGC 
ACGTGGTGAA CTACTC C ATT CCTCATGATA GCGCTACTTA CACGCACCGC GTcGGcAGAA 
CTGGACGCGC AGGATCACAG GGTATCGCTA TCAGTTTTGT ACGCCCACAC GAGACACGAC 
GGATGGAGTA TCTGAGTAAA CACTGTAATG GCGAATTGAA AGCTAGTACG GTACCTTTGG 
TGGAGCACAT CCTTACTCAA AAGGAGGGGC GTATTTTCTC GTCCCTCAAG ACTCATCTTT 
GCCAATTACT CTCTGAAGGG GTGCACGGAA CCTTTACCCG TTTTGCGCAc GGCTGCTCCA 
AGAAGACCTT AAAGCTCGCG TGGCAGAAGC CCTGGGTaCT TCCGCCGACG TTCCTCAGGA 
ACCGAACGTG TCGCTTGTCG CCGCGCTCCT GCAAATCCAC TACGGTACTG CGCTGGACCC 
CAGGCAkTAC CGGGATATTA AAACGATTAC GCCAGAGACG GCCCGCGCAC GTCCCCATGA 
mGCGGAAAAG GCGTATGTGC GCATTGAGTA CGGAAAAAAA AGCTACCTCA CTCGGAAACG 
TGTTGTGCAG TTCATCTGTG CCCTGGTAAA AATCCCCGGT CATC TTGTAG ATCGCGTTGA 
CATAACCGAA CGTTkCGCGT TTGCcGCATa CCCCGACGCG CAGgaGGAAg CAGTTCGCTT 
ATCCAAGAAG CGCAAGGACC TGCCGCGCGT TTCCTTCGTT GGGCACGCCA GTcGCCTAAG 
AAATACCGCT ACCCCTGCAG AAAAGTCTAC CTATCCAAGG CGCCTCCCTT CCGGAGAAGG 
CCTAAGGGAG CAGATCTCAA GGAGAACCTC TTCCTCTAAG AAGGCTTCTG GGAAACCGGA 
GGATTCTCTT CCCCCTCCCC AAGAACATCG CCTTGATTGA TGCAGCGGCT CACTGCGCCA 
CTACAGCATT CGTGCAAGCC AGCGCGAGAT ACTAAGGGCA TAGTTACCGA CGGCTTCTAT 
ACCACGCACG ATGTCCATAT ACAGGAGcTC CGCCTTTACA TCTGCACCCT GCTCAAGACG 
TCTGCGCACA AGTCCtTTTA GATGGGCCCC CTTGC TTTCG ATAGAGTGCG TCATTTGGTT 
TACGTGCAAC ACCTGCTTAT CTTCCaGTGG ACGGTTCaAG TGCGAATACA CAAAGTCAAC 
GCACTCATCC ACCATGCCgA CGTACGGGAC TAACTCCTCG ATATCATCAC GCTTGAGCGG 
TACATTTCCC TTGATGCTCT TATGGAAGTA CAACCCTATA CCACACAAAT GGTCAGTAAT 
ATTTTCAATA TCGTCTGCAA TGGAAAACAT TAATTGCACG TTATGTTTTG CTTTCTCGCT 
CAAAGAAAGA TGCGATGTTT TAATCAGAAA GCGCGAAAGC TGTTCCTGCA TTTGATCTGC 
ATAATCCTCT TCCTTTGTCA GGCGTGTTAC GATCTCATCA GTAGAAAGCA TACACGTTCC 
CTGAATGGAC TTACGGATAG TAACAAGCAT ACCCTGTGCT ATTGAAAACA TTTTTTTCAG 
TTCAATTtCC GCACGAAAAA TATGTGCCTC AGCGCTCTCT TTTACCGCAG TTTCTTGAAA 
AACAAGCTGA TACCTwTCTG GAGCGTCGTC ATACCGAGGA CGAATTAACC ACTCTACAAA 



16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 

16980 

17040 

17100 

17160 

17220 

17280 

17340 

17400 

17460 

17520 

17580 

17640 

17700 

17760 

17820 

17880 

17940 

18000 

18060 

18120 
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CGCTGCAAGG TGCTTAGTGA AGGGAAACAC AATAATAGTG TTGACGATGT TAAACATACT 18180 

GTGAAAGAGC GCAAGCCGCA CTGTGATGTT ATCAAAACCC GAATTCTTTG GAGTCAAAAC 18240 

ACACAAGAGT GCCAAAACTG GATGAAAAAA CATCAAAAAA ACCAATGCAC CAAACACaTT 18300 

AAACAGCACG TGGAcTGCGG CAGtCTCCGT GCGTTCAATT TACTCCCAAT GGCTGCAATT 18360 

GCAGCATCAA TGGTAGAGCC CACATTACTT CCTAATACGC TTGCTGCAGC GAACTCCACT 18420 

CCGaTGACAC CACCGAACGC CATAGTCAAC ACGATCGCAG TGGTTGCAGA CGAGGAGTGC 18480 

AAGATGACCG TTAACACAAA GCCTGATAGG AGTCCTACAA AAACACTGAG CGCACGATCC 18540 

TCAACTGCAA TTTTAAGGAA GGAAAGCTCT TCTACAGAAA GTGGAGGAAT GAGCGAAGAG 18600 

AGCAAACCAA GCCCGGTAAA GAGAAGACCA AAGCCCATGA TGCTCTCGCC CAAATGTCCT 18660 

TTATGCAAGT GTTTAAAAAA AGTCAGAAAA TAGCCAATCC CAAAGGCGGG GACAGCGATT 18720 

GACGCAAGCT TAAACTGAAA ACCCACAAGC GCAACAATCC AAGC AG T AAC AGTGGTACCG 18780 

ATATTCGCAC CAAGAATTAC GCCGATTGAC TGCGTCAAAG AAAGCACTCC CGCGTTAATA 18840 

AAAGAAATCG TCATAACCGT CGTAGCCCCT GACGACTGCA CAATAGCGGT AACTGCCATG 18900 

CCGGTTAGCA CCGCGAAGAA ACAGTTACTG GTCATCACTT GGAGAaTTTT GTGGAGGCTT 18960 

TCTCCAGTAC CCTTTTGGAT ACCGTCACTC ATCAGCTTCA TACCAAAGAG CATGAAGCCA 19020 

AGGCTTCCGA TACCCTGCAA AAGGACAGCC ACAAGGTGCA TCGGCGCCCA CCATAGCAAA 19080 

AACAGGGGAT ACGTATCAAT TGTCCGAAGC GGGAC AC TGC GCCGTACGGA CGTATGTTTA 19140 

TTAGTCAATC TCTCTTTTCT CAAATAGTCT CGCCGTGACA TCGCTT 19186 
(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4901 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

TGGTATGTGG TGCGCGATTC ACGTAAACGA AATGTTTCTT TTCGCAGAAC AGATGCTAAA 60 

TAGCTGGCGC ACAGCGATAC ACGCCATAGG GCAACTCTTT TTGCGCACGA GTGCGCACGT 120 

GCCGGTGACA CCCATTCATC TGTTAGATCC TGCTCACTAC CTGGAACGCA TTCCCATCGT 180 

GTTGTCGGTG CACGCGCTCT GTGGCGTTGC AGGAGGCACG CTATGCCTAT CGTTGGCGGT 240 

ATGCATCATT CCTGCGCTCC GTGCGGGCCG CGTGCGTCCC CTTGACCTCA TGCGCAAGGT 300 
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GTGATAGGAT 


GCCCCCAGTT 


TTTTTGATTT 


CTTACATCCG 


GCTCGCAGAC 


GGATGGGTAC 


360 


CATC TTTT AG 


AAGAGCACGC 


GGCCGGTGCG 


GTAGGGGCTA 


TGGAAGAACG 


CAAGAACTAT 


420 


ATGGATCGGC 


TTGCCGAAGC 


ACTTACGCGG 


CGCAAGGTGC 


AGCTCGATCG 


GGACATCCTA 


480 


CCAAAGGCGC 


TGGAGCAGTA 


CCGTGTCCAG 


GTCACAGCGG 


TAAAGGCAAT 


TCGCAGTAAT 


540 


TTGCTGCAAA 


AAGGTTTCTT 


GCACGATGAC 


GCGTACAAGT 


ACGACAGCAA 


GATGACTGAG 


600 


ATTGAGTTGC 


CAGAAACTTc 


CCCtACGGGG 


AAAATGAGAA 


GCCGATGGTC 


ATTGGTTCTC 


660 


GCCTTTCGCA 


CTATCAGACT 


ATGCTCGGTT 


TTTTGGACAA 


CTACTACCGG 


TTCGATTCGG 


720 


AGTTTCTAAT 


CCCGAAACGT 


ATTGCAAAGC 


TCGTTGCGCT 


CAACGGTACg 


TTCATGTGGA 


780 


AAGATTTTAC 


TGCTACCACC 


AAAGACGCGA 


ACACACGTGG 


GCTATTTGAC 


ATAGTGCAAT 


840 


CTTTC TATGG 


CGCTGCTGAT 


CCTATTTCGA 


TAGGACTGGT 


GAGGGATTCG 


TTGCAGTACC 


900 


TAG T AAAAGC 


CCATGAGGTA 


ATCAGCACAG 


CACTTAAGTC 


CCTTTCGGTG 


TTTCATCGCG 


960 


AGCGCTACAA 


ATTGCTTATT 


CGCCAGCATG 


CTCTGGATGG 


CTTGGACGAG 


AC AAC GGTGG 


1020 


ACGTCAACAA 


CCCAGAGGTT 


GCGCTTGACG 


CGATGAAAAA 


GAACTTTTCA 


GAAAATGCAA 


1080 


AAGGCCATCC 


GTTCTACAGC 


GAGTTGGCAA 


CCGTGGTTTT 


GAGAGAAGAC 


TTCTCTGCCA 


1140 


ACGCAGAAAA 


GCTGCGGGCT 


GCAATCCTCC 


GCGAGTTTGA 


AGAATC TTCT 


GCACCCAAAA 


1200 


GATGcCGCGG 


TGCTATGCGC 


AATCeACACG 


CCGTACTACT 


TTCTGGTTTC 


AGATCGCTTG 


1260 


GAGCTACTTC 


TAGCCACTTT 


CATACTGCTC 


TGGAAAAGAT 


TCGCTTCAAT 


GAGGAGCTCG 


1320 


TGACTCAGTC 


TGAGGCGGCC 


TTCTTTTCAA 


AGGTAGTGTT 


AGCCTTTCTC 


AAAGCTTTCA 


1380 


ACATTCAGAC 


GCGTTCAAAG 


GACGTTGAAA 


TTGTCGTCGT 


CGACCCGGCA 


ACACAGATAC 


1440 


AGAAAAAGGA 


ATGCGTAAAC 


GTTGAGCTCT 


TTCAAAAAGA 


GCTGGCCCGG 


TGTGTCAAAC 


1500 


TGTATCGGGG 


TTTTGTGTCT 


CCAGACACTC 


CGATTCATGA 


AAAGTTAATG 


GCGCTCAAGG 


1560 


ACGAGCAGct 


CTTCGAGCTC 


CTTTTTAAAC 


ACGTAGCAGA 


GGCGCATACG 


CTGGTTAAAC 


1620 


AGCTTGCAGG 


TCTTGATGAG 


TACTACAAGA 


CAGTGAGGTC 


TGATGTGCGC 


GCGAAAATTA 


1680 


AAGGGGTCAA 


GATTGAAGTT 


ACAACTATCA 


CCACTTCTGT 


AACCAAGGCA 


AATAAGTGCC 


1740 


GCGCAGAATA 


TGCCTCGCAA 


CTAGAGGAGC 


AAAAACATAT 


GAAGCGTTTA 


GGGGTAGCCC 


1800 


GTGCGTAGAA 


TGCGGCTCTC 


GCGCCGCGGc 


ATTCTCACGG 


TAGTAGGTAC 


CCTTCTTCTC 


1860 


CTACCTCTCT 


TTCCTTCCGA 


AAAAAAAAAG 


ACTCACGCGC 


CGCTCCCTCG 


ATCTGAAAGA 


1920 


AAAGAGTTTG 


TGGTGTCCTT 


TTCTCCGTAT 


AGGCCTGTGC 


TACACCCGCA 


CGTGGCATCG 


1980 


CGCGTGGACG 


AAGCACAGCT 


GCTCACAGCC 


CTATATGAGG 


GACTTGTCAC 


CTATGATCCG 


2040 
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TACGATCTCC ACCCAATCCC GGCGCTCGCA CAACACTGGT CGGTAAGCAC CGATGGGTTG 2100 

ACGTGGACGT TCTATTTACG AGATCAGATT TTCTTTCAAA ACGGCGACCC TATCACTGCA 2160 

GAGACGTTCC AGCAATCCTG GCTCAATTTG TTAAATCCTG AATGGAATGT GCCGTATGCG 2220 

TCTTTTTTGG ATGCAGTTGA GGGGGCACGT GCGTACCGCA GCGGCACTAC GGCTGACTCT 2280 

CACACGGTTG GGATTCTCGT AGAGGGGTCA GACAAAAAGA CACTCGTGGT CAAGCTCGCG 2340 

TACCCAGCAG GACACTTCAT TCAGATGCTC TGTCACCACG CATTTGCCGC AGTCCACCCC 2400 

ACCCAACTGG CAAGCGTCGG CACGCTGCAC GCGCGTACGG CAAGCGCCTC AGCACACAAG 2460 

CCGTTCCATC CTATCGCAAG CGGTCCTTTT GAATTACAAC AAATGCAAGC AGATCGCGTG 2520 

GTGTTGCGTG TTAACACCCG CTACTGGGAC AGGGaCGCGC TTGCCCTCCA CGCCATCGTG 2580 

GCGCTg cATT GCACAAGACC CTGCAGCGCG CGATGCGGGG TTTAACGATG GGAGCATCCA 2640 

TTGGATTAGT GGAGCGCTGG AGCACAGTTC TTTGCAGGAT GC AGCTAC AC TTCAGATCGT 2700 

ACCGCTTCTG GCAACAGAGT ATCTGTGTTT TAAAACGGCA CATGAGCCGA CGTGCAAgCC 2760 

ACGcTGCGCA AGGCACTGCT TTTAGCTACT CCGGTGGAGG AGCTTACCGC GCGCTATTTA 2820 

TTTCCCGCAC GAACGCTCGT AACTCCGTTT ACCGGCTACC CGGTACCGCC TGTAGTACAT 2880 

GAATACAATC CTGCGCGCGC ACGCTnT t TT AGCAGAAGCG AAGATAGGTG GGAAGACAGC 2940 

CCGTACTCCT CTTAAAATTC TCGTTTCCGA CACCGAGGCG TGCCGGGCAC TCGCACTTGA 3000 

ACTTCAGAAG GCCTGGACAG CCCTCGCACT TGCAGTGGAA AT C TGGGC AG TGCGGCCTGA 3060 

AACGTACCGG GAATATGTGC AGGATGAAAA ATACCACGTG AGAATCGTGT CTTGGGTTGC 3120 

GGACTTTGCA GATCCGATGG CGTTTCTGGA GCTGTTTAGA AAGGGATCAA AG AC AC AC TC 3180 

AACCGGATGG ACCCATGAGG AATTTGAGGC ACTGCTGACA CGCGCAGGAG CAGAACCGCA 3240 

CGTGCTTCGT CGTTGGGAAC TTCTTGCGCA GGCAGAACGT ATCCTCTTAC AGGAAGCAGT 3300 

TGTGCTTCcG CTTTCGCGTT TGCATGCACT GCACGCGGTA CAGCGGCGCA CGGTGCGCGG 3360 

CTGGTATGCA AATGTGCTCG ATGTGCATCC ATTTAAGTTT ATC TCGTT AC AAGAAGAAAT 3420 

AAAGGTCAAC CTAGACTCAT AGAGGGGCTG CAACCCGTGC ACACCCAGGT GTACCTTGCA 3480 

ACGTAGATGT ACCGGCGTGT ACAATGCCCT CTGCATACAC AGAGGGGATT ATGGGGTATC 3540 

CGTTTCGCGC TCTAGAGAAA AAATGGCAGG CCTATTGGCG CGACAAGCgs GTCTTTTGTG 3600 

TGTCCGAGGA TGAGCGCTTC CCTCCTGAGC GGCGTGCGTA CGTGTTGGAC ATGTTTCCCT 3660 

ATCCTTCAGC GCAGGGACTT CACGTCGGAC ATCCAGAAGG CTACACTGCA ACTGATATTT 3720 

ACTGCCGCTA CTTGCGCATG GGTGGTTACA ACGTGCTCCA CCCTATGGGT TTTGATGCCT 3780 
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TCGGACTTCC GGCAGAAAAC TTTGCACTCA AAACTGGTAC TCATCCGCGC GTCTCCACCT 3840 

CCGCCAACTG CGACACCTTT CGCAGACAGA TCCAGTCGTT TGGTTTTTCC TACGATTGGG 3900 

AACGTGAAAT ATCTACCGCA GATCCAGAAT ACTATCGCTG GACTCAGTGG CTGTTCCTCA 3960 

AACTTTATGA AAAAGGATTA GCCTATGAAG CAACCGCGCC CATCAATTGG TGTCCCAGCT 4020 

GCAAAACAGG CCTTGCAAAC GAAGAAGTAA GAGACGCGTG CTGCGAGCGC TGTGGTGCTG 4080 

AGGTGACGCG GCGTGGTGTC CGCCAGTGGA TGGTGCGTAT TACAGCGTAT GCCGAGCGTC 4140 

TCCTTTCAGA TTTAGATGAA CTTGACTGGC CTGAGTCAGT TAAACAAATG CAGCGTAATT 4200 

GGATTGGAAA AAGCTGCGGC GCGGAAATTG ACTTTCCCGT AGATGCGCCT GCGTGTTCAG 4260 

TGCACGATAA GCTACCACAG ACAATTCGCG TGTACACCAC GCGTGCGGAC ACGCTTTTTG 4320 

GAGTAACGTA CCTGGTACTT GCTCCCGAGC ATGAAGCGGT AACGGCGCTC ACTACACACG 4380 

CACAACGCGC AGCGGTACAG GCGTACGTGC AACGTGCAGC AAAAAAGAAC GATCTCGAAC 4440 

GCACTGATTT AGCGAAGGAA AAGACCGGTG TTTTCACCGG CGCGTACGTG CGCAATCCAA 4500 

TCAATGATAT GCGCATACCG GTGTGGGTAG GTGATTATGT GCTCGTTTCc TACGGCACGG 4560 

GGGCAGTGAT GGCAGTTCCT GCACATGATC AGCGCGACTG GGATTTTGCC ACTCGGTTTG 4 620 

GCTTACCCAA GTTAACCGTG GTGTCTGCAG ACTACACTGC AACAGTTCCT AATAGCAACT 4 680 

CCCCTCAAGG CGCGGTACTC CAAAGATGCG TCTCAGACGA GGGTTTTGTC GTCAACTCTG 4740 

GAGCTTTCAA TGGTCTTGCT AGTGCCGACG CGCGAGAACG TATTGTTGCC CATCTTGAAA 4800 

TGCGTGGCGC AGGTGCACGG CGCGTCACCT ATCGCCTACG CGACTGGGTG TTCAGCCGTC 4860 

AGCGC TATTG GGGAGAACCC ATCCCTCTTG TGCACTGTCC T 4901 
(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:' 123: 

CCCACCTGTC TCACCGGCCT GAACGCCGGC GTCGAAGCAC GCGTGATACA TCCCCCTCAC 60 

CTACATCCGT TACAGAAATA ACGGAGGGTA CGAACTGAAT GGAGCTGTGC CCCCTGGGAC 120 

TATCAATATG CCAATTTTGG GGAAGGCGTG GTG CAGCTAT CGCATCCCCC TCGGTTCCCA 180 

CGCCTGGCTT GCACCACACA CATCCGTGsT CGGCACAACC AATCGCTTTA ACATTATTAA 240 
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CCCCGCGGGC AACCTGTTGA ATGAACGAGC GCTCCAGTAC CAGGTGGGAC TGACGTTCAG 300 

TCCCTTcGAG AAGGTGGAGC TCAGCGCCCA GTGGGAACAG GGCGTGCTTG CTGACGCTCC 360 

TTACATGGGC ATTGCCGAGA GCATCTGGTC CGAACGCCAC TTCGGCACCC TTGTCTGCGG 420 

AATGAAAGTG AC ATGGTAAA * AACGCGTGCT GTTCGATTCC ACCTCCCCTA TCAAACCCCG 480 

TTTTGCTCGT CTTGCCTTTG CAGTTGCAAA ATTTTGTCTC GGATGAGGGC TGCCTCTTCA 540 

AATCGTAACT CACGCGCACA AACCTTCATG TGTAGGCGCA ACGCCTGTAc CATTTTTTTG 600 

CGTGCAGCAT GTGTGCGCAC GTCTGCGTCT GCTGCGCGCA ACAGGGGTGC GACCTGTACa 660 

cGCGCAGtCT TTTTTTACTT CCTGCTCACG GACCAGAATA TCTTCAATAG ACTTTTTAAT 720 

CGTACGGGGT GTAATCCCAT GAGCACGATT ATACGCCATC TGAATCTTTC TCCGTCGAGC 780 

AGTTTCCTCT ATTGCTTCAC GCATCGCATC GCTGATTGCA TCCGCGTACA TTACCACAGT 840 

TCCGCGAGCA TTACGTGCTG CTCGACCAAT AATTTGGATG AGACTCGTCG TCGAACGTAA 900 

AAAACCGACT ATATTGGCAT CCAAAATAGC AATGAATGCC AC CTCGGGC A AATCAATACC 960 

TTC TCGTAAT AAATTTATTC CAACTAATAC CTCACATTCC CCCGCACGCA GACTCGTGAG 1020 

AATTTCTACG CGTTCAATAG TTTCAATTTC CGAATGAACA TACTTTGTCC TTATTCCCAG 1080 

TCCATTGAAA TAATCTGTTA AATCTTCAGC CATTTTTTTT GTCAATGTTA GCACCAAAtA 1140 

CGTTCGTTCC gCGcACTACA AGCTTTTACC CGc TGaCATA TATCTTCTAT TTGTCCATCC 1200 

GTTTTTCTCA CTTCGATGCA TGGATCTAAA AGTCCAGTGG GACGAATCAG TTGTTCAACT 1260 

ATTTGCACAG ACTGTGTGCG TTCCTTCACC CCAGGAGTTG CAGAAATAAA AACTGCTTGA 1320 

TTTAACAATG CCTCAAATTC CGAATCTTTC AGTGGACGGT TATCTCGTGC ACACGGCAAG 1380 

CGAAAGCCAA AATCGATGAG ATTCTGTTTA CGCACCCGAT CTCCTTCATA CATTGCAC C A 1440 

AGCTGCGGAA GTGTTACGTG ACTTTCATCA ACAAAGAGCA CAAAATCCTT TGGAAAATAA 1500 

TGAAGAAGCG TCACCGGCGG TTCACCAGAT TTTCTACCTG CAATCGGCGC AGAATAATTT 1560 

TCTATACCGT GGCAATACCC CATCTCTCCG AGCATTTCAA GATCGTATTC TGTGCGCGTT 1620 

TTTAAACGTG CCGCTTCTGC AAGCTTATTC TCTTGAGTTA ATTGTACCAA CCGTTCATCG 1680 

AGTTCTTGTC TAATACGGTC CATGGCGCGA GGGATTGCAT CCTCTTTAAG TACAAAATGC 1740 

TTTGCAGGGT AAACGGTAAG TTCTTCAAAT. TCCCTTAGAA CAGCACCGCT TACAGGATGA 1800 

ATGCGACGGA TACGAACAAC TCGATCCCAA TCGCACTCGA TACGATAAAA TTCTTCTAAA 1860 

TACGCAGGGA AAATTTCAAT AACGTCTCCC CGAACTCGGA AGCGACCGCA CTCGAGCACC 1920 

GCGTCGTTAC GCTCGTATTG CAGAGATACA AGTTGCCGCT TGAGATCTTC AAGATCAAGA 1980 
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CACTGGTTGA CTTCCACGTG GATACGCAGA TCACGCCAGG ATTCAGGCAA CCCAAGACCG 2040 

TAAATACACG AAACAGTTGC GACTACAATA ACATCACGAC GTTCCATGAG ACTAAACGTT 2100 

GCAGATAAAC GCATTCTATT TATCTCTGcA TTGATAGAAG CATCTTTCTC AATGTAGAGA 2160 

TCACGAGCAG GGACATACGA TTCAGGCTGA TAATAATCGT AGTACGACAC AAAATACTCC 2220 

ACCGCATTGT CTGGGAAAAA ACCTTTAAAT TCCCCGG 2257 
(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 992 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

TACTCATCGG CGTCGCGCnT TCnTACTCCT TGGGCATTGA CTACTTCAAC TACCCCGTCT 60 

GATACCATGA CCATTACATC CCCAGGACAC AACGTACGCT CATGAATCTG TATAGGGGGC 120 

AGATCCACAA TGCCAATAAC TGGACAATTC GAATGAAGAT GATAC AC TTC GTGCGCCTCT 180 

CCCGCCTGGG TAAAGACAAG GGGACTTTCC ATCGATGCGT TAATGTAGCG AATATTCATA 240 

CCCGCTGTGT CAATTAATCC CAAGAATAAG GTCGTGTACT TATCGTGGAG ATGCATACGC 300 

TTTACTGCCC GGTCCACCGC ATACAAAATC TCAGGAAGAT TCTTTTTGTC TTCCACGATG 3 60 

CGAATCGTAC TGAGCACAAC ACCCATAACT AACGACGCGG CCAAACCTTT GCCAGAAACA 420 

TCTCCAATTA CAAATAAAAA CAGGTGTTCA TCAATTGAAA TAACGTCGTA ATAATCCCCA 480 

GATACATTAA CCAGTGGCTG AT AGAATGC C CCGACGCATA TTTCCTTGGT ATGTGGGAGC 540 

GCCTTAGGCA AAAGTGCGCG CTGTACACGC GCCATCATTG CCCATTCCTG GGATACATGG 600 

GAGTACAACA AC AAAGTGC T CATGTTCCTC TTTCGATTTA AATACTCTTC AAACTCTTTG 660 

AACAAGAGCG ATATAACTTC GCGCTCAACA GCACGGATAA AACGACATAC TATAAAAAGA 720 

CGCAGCTCTC CACTGGAAAG ACATACCCCA CGAGCCCGAC GTCGGTCAGA CATAAGACAC 780 

AGATCATCAT CAAAAAAATA TATACCGTCT GACCAGTGCC ACGTGTAGTC CATAGAAAAC 840 

TTATGAAGCA CAAGATCATA CGTGCGTGTG TCTGAAACAA ATCGTGGCAG CACTGTTGTA 900 

AATAACACGn TTACTTATCG TATCCATTAA GAGCACTGCG CAATCGGAGC GATATTCAAG 960 

CACCTCCTGG AAGGCAGCAA CAAGTTGTTC AT 992 
(2) INFORMATION FOR SEQ ID NO: 125: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 
GTGCAGGATC CGTCTCTTGT TCACGAAGAA CTCAGACACG GTTGGCGTAT CTGCAGATCG 
CCTATGCTCC ACGGGGATAT ATTAAAAAAC ACGCGCACGC TAGAAaCCtC GCCGCATATA 
AACgGCGGTC AAg A t TTGTC CAATACGCGA TTAAGCACAA TTACACCACT GAGTCAAAAA 
GCAGTATCAG GATAGACAGG GGAACAGGTA TGCCTGATAC GGTAATATTT ACAAAAATAA 
AGGTCATTGA CCAAAAGCTG GCTTC CTTTG TAAGGGAGCT CATGTCTCAC CAGGTAGATC 
AACTCACGGT TTTACAGAAA ACGGAGGAAA TCTACGGTAT GCTCGTGGAC CTACTGATCT 
AGCACTAAGG AAACTTATCA TGATACCTAA ACTGAGCCCA AATGCTGACC CCTGCACAGC 
GTGTGCTTCC ACCCCTGCCT TTCCGCCGCA GCCCATGGAG GAGGAAGGAG AGTTT TT AT A 
TCAGTTGCGT CTTGAGTACT CCCGCGAAGT GTTATATGnC GTTTTCACGC GGnTACGCGT 
GCACACTTTT GnTnGGTACC CACGCGTTAT GGCAACGACA TTGCTCGGTG CGTAGGGCGT 
ATACGTACAC CGGTACAAAC CGATATCGCG AGTGTGGTAC GTGTTGCATC AGACCAAGAT 
TTGTGCACAT GGCACATACA TAGGGAAAAG GAACGTGCTG CGGAAATGAT TTTTCGGGAT 
CGCATTGAGC ACTATCAACT TGaGATGaAA TGTATTTGCT GTCACTATCC TTTAGAAgAA 
gCACsCsTGG TaTTTCTATA CAgTGCGCCA GCACGTkTTG ATTTCaGAGA ATTAGTTAGA 
GACTTAGGAG CTACATTTGG TACGAGAGTC GAACTGCGAC AGATAAATGA ACGGGAAGAA 
GCGCGGATAG TAGGCGGAAT TGACTGCTGC GGGCGCGCGC TATGTTGTTG CTCAGTGTTC 
AGCAGGTTGC GTCCAGTCTC GGTAAAAATG GTAAAGGAAA AAAATCTATT ATTTCGTTCA 
ACCCAGATGA TGGGTCGTTG CGGACGATTG CGCTGTTGTT TGACGTTTGA GGAATGATCG 
TTACACACGT AGCCTGTGTG GCGCACCTAA GTCGCTGCAC CCACTCGTAA CACCACACTC 
ACATGTGCAG TGTGTTCGCG TGTACAACCG ATGTGAGCGT GGTGGGGTTT GTTAGCAAGG 
GGACGTTGAT CTGTTCGTGC TGCTGAkTGT CTTTGTCCCG TCGTATAGAA CACACTCGCA 
GGTTTCCCTG CCCAATACCA TTACTTAGCC CACCTCCTCT AGGAAATTTT CGTAAATGCA 
GAACGTTTGG TAATACTTGA TTTTTTATAA GTCTCCATTG AATATAGGAA ACGAGTATAC 
CCGTTAAAAG CATAGCATTG CTCGACATTA TCTATTGTTT TACGCAAGAG GATGCAGAAT 
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GAAGAGTCTT GAATATTATC GATCACAGCC AAAAGCAGAT GTGCACACGC ATCTGAATTT 
GAG T ATGAAA TACGAACGAT ATAAGCAATG GTCAGGAGTA GTCATTCCAA ACTTTCCACG 
TAAAATGCGC GGGCTCGACG AAATGCATGA AATTATTGGT GAGTACACGC GTCCTCAGTG 
TAAAACTGCG CAAGACGTGT TGAATTTGTT TACCATGTCC ATAGAGGATG CCATTGCAGA 
CAATGTCGTC GTAATGGAGA CATCAGTTGA TATTGGCTTT ATCACCCATT ATGAAGAAAA 
TTTGGATCAT TTCTTATGTG ATTTAAGCGA TCTGCATCGA CGCTACAAGC GCAATGTTAC 
CCTTCACTTT GAGCTCGGTA TCTCCAAAAT ACGAGAGCGC AGyTnCGTAG AACAGTGGGC 
TGAGCCCATG ATGCGAAgCG GTATCTTTGA AAATATTGAC CTCTACGGTC CAGAGATTTC 
CGAAGGAATC GAAGATTTCA TCTATATTTT TAAACTGGCC GAGAAGTATC ACTTAAAAAA 
GAAAGCCCAC GTAGGCGAGT TCTCTGATGC GCAATCGGTA CGGCACTTTG TCGAAATATT 
TAAC CTGGAC GAAGTCCAAC ATGGCATCGG AGCCGCTACT GACGAGAACG TTTTGCGGTT 
TCTAGCTGAA AGAAAAGTTC GCTGTAACGT ATGTCCAACC AGTAATGTCA TGCTCAACGT 
CGGTGGAATG CCCTAGAAAA ACATCCTATA AAAAAAATGA TGGATGnCAG GGGTC CGTGT 
TGGGTTAGGA ACTGACGATC TTCTCTTTTT TGGAAAAACA AATAGCGAAC AATTGTTTGA 
TATGGTTTCC T 

(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2169 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 6: 
CCGCCAGTCC ACCGGTGnCA CCCTTCAACG CCGGTTAGGG AAAGCGTGCG GCACGCCAGC 
GAGCGCAGCG CATGCTGGAG CkCATTCCAA G AATGG AG T A TGCCACGCAC AGGCTCCTGG 
GTACGAGCGT GCATTTCTTG AGCcGCtGTT GACACACGGC CGAGTGTAAc GGAGCACTGT 
GCCTTGTGCT AGTCCCTGAG CGAGAAGGAT GCAGCAGGAG AGGGCAAGGG AGGAGAACAG 
GAAGGAGCAC CGGTAGTAGT TGATGGGGTT AATTTTTTTC CGTTATGATC GCTGCTATAT 
GTTGTCGGTG TTGTCGTATC TTCACGTGTA CTTTGGGCCG TTTCGTCTGT TGCAATCTTA 
TGCGGTGTTG ATGGGGATTG CCCTGTATGC GGGATTCTTT TTTACGTATG GGGTGTTGCC 
CAGTGCGTAT CGCTTTTTGC CCCAAGACAG GGGGCGTGCG TTTGCGCCGT GTGCACAGGA 
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AGCAGCGGGT AAACCCACAG GGGCAGGAGT GATTTTTGTG TCCGTCTTTG TGTTGTTAGT 
GTACCTGCTT ATGCGTCCGA GTTTTGTGCA TGCGCTTATA TTGCTGCTGA CGTGGGGGGT 
GATGCTCACC GGATACTTAG ACGATTGCGC GCAGGTGTGC TGGGGGGAGT ATCGCAAAGG 
CGCGTTGGAC TTTTTGTTTG CGGTGCTGAC AGCAGCGCTG TTGGGTCATT TTTATTTTCA 
CGATCAGGTG TTCTGGTGGT TTCCTTTTTT TTCAGATCCG GTGTTCGTCT CTCCTTTTTT 
ATTTTTTGCC GGTTCGG TGG TGATTTTGTG GATGTCAATT AACGCAACCA ATTGCACAGA 
CGGGGTTGAC GGGCTTTCGG GAGCGTTGGT GTTGATGGCG CTTCTTTCGA TGGGTACGAT 
TTTTTACTTT TTGTTGGGAA ATGTGCGTGC GGCGCAGTAC CTACTGGTGC CGTTTGTAGT 
GGATGGTGCG CAATGGGCAC TGATGAGTTT TGCACTTGCC GGGGCGCTGA TGGGGTATGT 
GTGGCGTAAT GCACACCCTA GTACGGTGTT GATGGGAGAC GCAGgCTCCC GTGCGCTGGG 
GTTTTTCATT GGGGTGTTGG TGTTGATCTC GGGCAATCCA TTTTTGCTGT TGATGACAAG 
CGGTGTTATT TTGGTGAATG GGGGTACGGG GCTTCTAAAA GTGGTGTTGT TGCGTTTTTT 
TCATGTGCGG ATCCTGAGCC GGGTGCGCTT TCCGCTCCAT GATCACATGC GTGAGAATTG 
GCACTGGTCT ACGGCGCAGG TATTGCTGAG GTTTATGATT TTACAGGGAC TGCTCACGAT 
TGGTCTTTTG GGGGTTTTGT TCAAACTGCG GTAGAGGGAG GGCaCCCCTT GCGGGGCACG 
CCGGGCCGAG CGAGGGCGAC GGTGCGGAGT ATCCGGCGCC TTGACGTGCG TTTATTTCTT 
TTGCTAGCCT GCCCCTAATT GCTTTCCGTT TCCGGAATGA TGGTAGAGGA GACAGGGCGG 
AAGGCGTGGG GTGTGTATGG TGCCGGTGAG AAGgTTCATA GCGGTGTGTG CGGTGACGGC 
GTGTGCCGGG CCGTGTTTTT GCGTTCAAGC GTTTATCTCT TCTCGGATCG GGTATGGGCG 
CTTTGGGATA TATGGGAACG AGATAAAGGA CTCCTACTAC AAACATGTTC CGATGACGGG 
ACTAGGGGTT GACGTGGTAA CGTCTTCAGG CGTTGCGATG GTGTTCAATG TGGAGAaTGC 
kTTGACGCaG CTCATGTTTC gCGCGCAGGC GCTGCTGGGG TACGCGTTTG AGGTTGGCAG 
GTTCCGCTTT ACACCTGCCA TTGGCGGCAG TTTCCTTGCG TCGCACGACC ACGCCGCAGG 
GGTGGCTCTG TCGCTTGACT TTCAGTATTT CTTTAATGAT TGGGTCGGGT TGGACCTGAA 
CATAGGCGCG GGGGTGGATG TTCCGGTGAA CAGTAACCTG CGTTACCTGA TGCGGGTGGG 
GACGCCGGAG TTAGCGAAGA TTCTCATCAC GCATACAGTG ACGCATGGAC TGGCTAATCG 
CTGGATATCA GGTCCCCACT GGTGGAATTC TCTTTCTTCG TGGGTCGGGA ATACCGCGGG 
AAAAGTGGCT GGATTTGTAG CGCGTTTGAT AGCAAATTAT CTGCTGAAAG GCTCACAGTA 
CAGCATGTT 
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(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

TACCACCAGC AAACGCCGTG AGGTCAGTCC TTCCCGGCAA ATTTCAAAAA AGAGCGCAGG 60 

GAGCCTGTAC ACATTGAGTG CCAACACCCA CAGCCCGAGG AAAACATAAC CCACAGCCAT 120 

CACTGGTACT ATCACCGAAC TGACAACACT AATGCGCGCG ACCCCCCCAA AGATAACAAA 180 

CGCAAGGAGA AAAGACAGCA ACACCGCTAG GATCTGGACC CACAGAGAAG AATCCTTTCC 240 

GTAGTAAGAG AGAGTAGAGA CAATATTATA AGCCTGTAGT GCATTGAACC CGTACGCGTA 300 

TGCAAAGACA AGACACAGCG CAAAAAGCAC CCCCATGGAG CGACTTTTCA GACCCATTTC 360 

GATGTAATAG GCGGGACCAC CTCGAAAACC ACACGCAGTA CGCGTTTTGT ATGCTTGGGC 42 0 

GAGCGTACTT TCGACAAAGG CACTTGCAGC GCCAAAAAAG GCACTCACCC ACATCCAAAA 480 

CACTGCCCCT TTTCCTCCAA AGGCGATAGC GTwAGcAACG CCGACAATGT TCCCAGCCCC 540 

CACACGGCTC GCAGTGGAAA TCATAAGCGC TTGAAATGAT GAAACTc CTT TCCCCCTCTT 600 

TTCAGCCAGC GCTGCAAACG CrGGtTCAGA AGACTAAGTT GAACACAGCC AGTCTTTATG 660 

GTaAAAAGAn ACCGCAGACG ACAAGCCAAC CGA 693 



(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4835 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

CCACTTCACA TCTTTTCATC AAATTACCGC ATAATACGAT CAACCCTCTT TCCAAATACG 60 

GACACCAATT GCCTGCAAAC GCTCCACTaG ACGTTCATAT CCACGCTCAA TTTGATACAC 120 

GTTACGGATT ATACTTACCC CGCGCGCACA ACACGCTGCA ACACCATCGC CATTCCCGCA 180 

CGCACATCAG GAGATACCAG GTCACTCCCA TGCAATGCAC TCGGACCTGA AACAAGCGCC 240 

CGATGCGGGT CACACAGGAT GATACGCGCA CCCATGGTAA TTAACTTGTC CACAAAAAAC 300 
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ATACGCGACT CAAACATTTT CTCATGAATC AATATAACCC CTTCGACTTG AGTCGCCACC 
ACTGTCATAA TGCTAGTCAG ATCCGGCGGA AATGCAGGCC ACGGGCCATC ATC TATTTTA 
GGAATCATAC CGCCAAAATC GTAATTT AC C C GC AG ATC C T GCGACGCaGA GACGCTTACC 
GCGTGCTCTT GCTCGCTCCA AATTACTC C A AGTTTTTTAA ACG CAAAACC TAGTGGACGT 
AGATCACGTA CGTTCACATC CGAAATCGTC AATGCTCCAC GCGTTACTAC CGCAAGCCCA 
ATAAGCGAAC CTACTTCCAT GAAATCAGCC CCAAGTGTAT ATGTGGTACC ATGCAACGCG 
CTCACCCCCT CAATTGTTAA AACATTCGAT CCGATGCCAG AGACGCGCGC ACCCATCGCA 
TTCAACAAAT GGCATAGATC TTGCACATGC GGTTCGCTCG CCGcGTTCGT GATTACCGTA 
ACCCCCTCGG CAAGAACTGA AGCCATAAGC ACATTCTCTG TCGCCGTCAC AGATGCCTCG 
TCTAAGAAGA CATCACATCC CACCAGCTTA TTTGCAGAGA AAGTAAACAC CCCATCTAAA 
CGAACTTGTG CGCCAAGTGC GGCAAgCGCA AGAAAATGCG TATCAAGTCT CCTGCGCCCA 
ATAACATCAC CACCAGGTGt GGGAAGCACT GCTTTTCTAC CACGCGCAAG AAGTGGCCCT 
GCAAAAAGAA TGGAAGCACG CACTTTCTGC GCAgCTtCGC ACGGCACTTC GCACGTCTGT 
AACTGAGGAA GATGTAACAT GTACTCATGA TTACCACGCC TTTCAACACT CCCACCAAAC 
GCACGAAAGA TAGTTAACAT CACCGCCACA TCCTCAATGT CGGGCACATT TTGCAGTAAC 
ACTGGCTCTT GTGTGAGAAC TGCCGCCGCA ATACAGGGAA GCGCTGCATT CTTATTTCCA 
CACACGCGAA TACAACCTGA AACGGGAAAC CCACCTTCAA CACGATAGCA ACTCATGCTC 
TCACCCCTTT GCGCACACAT TCCTGTAGAA CACCGATCAA CATACAACTT AATTGCACGA 
ATACCGCAAA AGTACGGCCC CATCCTGTGT ACTTACTTAG GCGTCAGCaA ACGCTTGACC 
TACATGCTCG AGTGTCCACG AGGGCAGACA CAACATACCG ATCGAGTACA CGGCGTATCT 
TAAAACGCAA TCCTTCAATT ACCAACTCAT CCCCTACCGT AGGaGACGTC CAAAACGTTC 
CAGGAGTAAA CCTCCCACCG TGTGGTACAc GCGCGACTGC AAACGCGTAC CTAAC AC TTC 
GTTAAAATAT TCAAGAGGAA CCGTTCCTGA ACATAAAATC TCATGAGTAC CGACGCTTTT 
CATCTGAAAA TCAACACGAG GTTTTGACAT TCGCTGcACC ATTGACCCAT GCACTGTGTA 
TCCTTCTACT CCTTCCGAAA CTCCAGCAAG TGAACCAAAA ATTACCTCCA TAATGTCTGT 
CATGGTTACT AATCCTTCCC CATCGCCCCG CTCGTCTACC ACTAACGCCA TCTGcTGCTT 
TGATACCGAG AGCATATCTA ACACTGAAAA AAGGTCAGCT ACATTCGGTA CACAACACAA 
CTCCTGTGCC AAATCACCCA CCCGATGCAC CGCAGCTCTG GTGTTCGCTA CACATGCCGA 
CACCATCTGA TCTGATGACT GCGAGGGTAA TACTTTACTT TCTCCAAAAA TATCCCAGTA 
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GTGTACGAAA CCCCAGACGA TCCCCGCAGC ATCATGCACG AGCAACTGAG AAAAGGAACA 
GGTACGAAAA GCATCCCACA CATGAGATAG TGGTGTGTCC TGAAAAACTG ACACAAAGTG 
TGTGTGCGGT ACCATAACGT GTGCAAGCGG TATGCGTTTA AACTGTAAAG CGCGCTGGAA 
CAATTGATTT TCGGTTGACG AAATCACCCC TTCTCGTGCC CCAACTGCAA TAAGCGTTTT 
AATTTCTTCT CGCGAAAGAC ACGTCGTATG ACGCGGcAAA AAAATACCCT CCAGCACATG 
CATCAATGCT GACGACACAC ACGCGCAGGG GTACAGCAAC CAGTAACTCA ATTGCAAAAA 
AGGCGCAATC CACATCAAGA ATCCCAGTGA GTACCGTGCA CCCAGCGCCT TCGGGAACAT 
TTCTCCACAA AGAATAATCA CGCACGTCAC CGCAACCAGT GCCTTCCACA CCGACTGTGC 
ACCCCACAAC TCCATAGAGC CTAACGTCAC CACGCTAGAG AGCACCATAT TCAGTGCAGT 
GTTTTGCACA ATAACTGTGG TAATCAGCTG TTCGCGCCGG GCCAGAAGCC AGCATAAGCG 
TTGTGTACAA CGTGTACTGT GCCgCTTAAG CTTACGTTCA TCGTCTTGGT TCACCGACGA 
CAACGCGCTT TCTGAACCTG CACATAGGGC CGAGCACACA AGTAAGAACA TGAGTTCCAA 
TCCTACCAGA AAAGGGTATG CCACGCGCGC TTCCCTACT A TCCTTGAAAC TCAAACCGAA 
CACGTACGAT TCTGCGCACA TGCAGTTGTA GCACCACACA ACGCCAAGAA CCGAATACAA 
CCGTGGTGCC AGGATCAGGA ATACACCCTG TATACTCCAT GATAAGACCT GCAAGGgTCT 
CACTGGTACA CGAAGAAAAA TCGGTGCCGA GTAGATCATT TATCTCATCC AAGCGCAgcG 
ATCCAGGAAA TATATACGCC CGTACCCCCG CACGCGTAAC cTGCGGACCT GTTGAATTCA 
CAGGGAATTC ATGCGCACTG CTCTTAAAAA ATGCTTGGTA TATATTGTGC TTTGTCACAA 
GACCTGCCGT CCCGCCATAT TCATCAAGCA CAATGGCGAC TGCGCGTGAG TGTGCGCGCA 
ATTTGTGCTG CACATATGCA AGTCGTGTAC ATTCGAAGAC AAAAACCGGC GCGCTCACAT 
GTTGCATCAG TGTTCCGCAC TCTTCTAAAT CTCGTCCGTC TACCTCTTCT GAGCACAAGA 
ATTTCTTCAC ATCGAAAATA CCAATTATCC AATCAACACT CCGTTCATAC ACTGGAACAC 
GCGAAACGCG CATCTTCTGT GCACAGGCAA TTGCCTCCGC CAGAGAACTC GCGCGCGGAA 
CTGCAATCAA TTGCGCACGA CAGGTCATAA TATCTCGCGC AGTAAGGGAT GCAGAATGCA 
AAATACGTTG ATACAATGCG CGTTCGCGGG AAGTCACAGT GCCATCCGCC TCTCCAGCGT 
ACAGTACGGT GTGCAGGTCG TCATCCGTAA CACGCAGCGA GGGAGTGTGG CACGCGACAC 
GCGCAAGACG CAAGAGCGCA CTCCGCGCCA TACAGAACAC CTGTACAAAA GGAGTAAGCA 
TCAAAGCGCT CCACTGCAAG AATCGCGCAG TATGCAGTGC CAtGCGTTCG GCCGGCACAA 
GGCAAGTGAC TTCGGAATAA TTTCTCCAAA AAGAAGTGTA AGCACCGTTC CTGCACCGAT 
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041 

GCTCCACCCC ACTGCGTGGA TGCCAAAGAG GGCACGTGCA AAAAGCGCAA TGACTGCAGA 3840 

CAACGCACTG CTCGCCAGGG TGTTCCCGAT AACCACAGCA GCAAGATAGA AGTTTTTCCG 3900 

TCGAAGGATA CGCATTGCCA CTCGAGCGCG AgCATGACGT TTTTCGTACA GGTAGCGAAG 3960 

TCTTAAGGTA TTTAGCGCAC AGAACGCTGT TTCCGCAGCA GAGAACAACA TGGAAAGCAC 4020 

CAGCAGCACT ACCAACACAC CGAACGCAGC GGAAACGGAA AG AAC AC TC A CACGTAATTC 4080 

CCCCGAAGGc tAAAACACCA GAACCGAAGA GACAACGCCA TACATCCTTG GACCCCTCCC 4140 

CGCTGGGGGG GGGCACCTTT TAAGGTGCTC ACGCCCTTGT GTCAAGAGCA CACCCTCCAC 4200 

TACAATGAAC TGCGTGTCCG GAGACCGCGC GGAGTCCTCT TTCTATGAAT AGAACCGAAT 4260 

CTCC TCGTGG CTTAATCAAA GCCACCGTAC GTGAACAAGA CCGAGGCCGA ACCGTTTATA 4320 

AAAAGATTGC CCAGTTCCTC TCCCTCATTG GAGAAGAGCA GGCGGCGCTG GTGCTCAAGC 4380 

AACTTGAGCC TGCACAGATT GAGGCGGTGG TTGCCGAGCT CCTGACACTC AAACCCCTCA 4440 

GTCCAGAAGA AGCGCGTGAG AT cTACGGG A GTTTTCTGCC CTCTGCGCTC GTGTGTCGCC 4500 

TGTTACCGGT GGATGCGTnT GCGCAGTCGA TGCTTTCCAA AGCGTTTGGG GAAGAAAAGG 4560 

CCGATCTTAT CTTGAAGCGG GCGGTGCCAG CGGCACAGCC GAAACCTTTT GAGTTTTTGG 4620 

TGCGCTTGAa GCCTCCCAAC TTCTCCCCCT CCTGGAAGGA GAACTACCTG CCACCAAAAC 4680 

ACTCATCCTC TCGCAGCTGC CTCCAGAAAG CtGCGCACTA TTTGAGTAAT ATCAGcACAG 4740 

AGGAGAAGAA GGACTTGATC GTTCGCCTTG CAAAGTTAAA GCACGTTAAC CCTCAGGTGC 4800 

TGCAAGTCAT GAGTGACTCC TTGCACAAAA AGTTT 4835 

(2) INFORMATION FOR SEQ ID NO: 12 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2355 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

GGCTTTCCCT GCGTTATAGG AGGTAACGAT TGCACCGCCT GTTCAAATTT TTCCCACACA 60 

CCTGGTGGTG ACTGCGATCG CATCTGTGCG GCGAGTTCAT TGCGTGTCTT CGCAGCAGCA 120 

ACAAGTAACC CAAGCGCGTG CATGAGCGCG GTGTCAAAAC GATTACAACC ACGCTGTGCC 180 

GCAAGCGTTT CAGCGAGGGA CTCCCTCGCG GCACCCTTCT TGTACGGTGT CCCAGCATCG 240 

AAGGCACACA CGATCTTGAA TCCTGCTCCC GGGGAAAGCG TCAAACGCGC GCCTACATTC 300 
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CACAGCACAG TATCTTTGTT TTGATGATGC GTCTGATCAG GTCCAGTTCT GTACCCTGcA 
GAAAGAGTAG CACAACTTGc TGcCTCAAGC CTGaCCTGTT CCCGCCCCTT TCGGGCGTAT 
ACGAGCGTGA ACTCAGTACC AAGGCCATAT TTACTGTGCG GGTCCGCTGG CATGCGCCGA 
CTATTGGTGC CCTTTCCTTT CACCTTAGAG GTGACAGAAG GAGCACTTTT CCATATACCG 
TTCGACGAAA AGGAAAGGGC ACAGTTGAAT GTAATGCCAC TGTTCGAAAT ATTCCGTGCC 
TGATATGCGA TTTTTCCGCC AACCCCACCA AACCCTGGAG CGTATTTGAC GCTCCTTGAC 
TCATAACTAG TAGTAATAAA GGGGGTCCAC AACTGCGCAA AATTAGAGGG AAAAACGGGA 
TCTTTTCCTA CAGAAAAAGA GACATCGTAG AGGTGGAGTG TGGCCTCAAG GGAAAAATCG 
CTTCTTCCTG ACTTTAAGAA AGGAGAACGC GTCGTCATAC TTGGATCCGC AGTTCCCGAC 
CCTAAAGCAC TTTCAAAATC CACCTTCAAT CCCTTGAGAG AAAGCTCAAC CCATATGGGA 
TCCTCACCTG AAAAGCTCGT ATACGTGGCG CCTTTCTTGG GCAACAAGGG AAAAGCAAGC 
TTCCAGCTGC TCTTCGTGCG AAACCCATGT CGTATGCTTT TACCCGCTGT AACTGGAGAG 
GCACCTTCTG CATCAAAGAC GACACCCCAG CTAAGTTCAG CAGAAcCGCT GAACGCGGCG 
GAGGACGAAA AGGCGCGTGA AACCCCGTCC AAAACCGCAC CATGAAGGAA AAACACCCCA 
CACCCTAACG CAAAACGCGC ATATACGGAT ATACAGGCGC CCATAGGGCA GC C ATTATGA 
CCCTTTCATA AACACGATCA ACATTTTTTC CCGTCAGCGG TAATTTTTTT CACTACTTCA 
CTCAGTTTTC GCACAAATGG CACAACACCA CGCCCCGTTC CTTCTATACG CCTCCCCAGA 
AAGTGACTCT TCTTCCCACC CACGAATATC TAATGTACAC CAACATACAT TACAGATACT 
GCTAGATCTG ACACATGACA TCGTTGTACT AAACTGTGGT GAAAACACGG T AC AC T ACTG 
TATGCATGCG TCTAGTGTTA GGCTCGTGCA TTT TT AT AC T TTTACTCCGA GGGCGCGCTA 
CCGTATCGCG TCTGCACGCG AGCCCGQCCG TCACCATTTC GGGGAGTACT CGTCTTACTT 
GGGGCATTAA CTTAGGCGCG AAGGCGAACT TCGTGCTACC CGTAGCACCG CTTGGGGCAA 
CCGGCACTGT GCGAGAGAAC CCCAATCATC GCTTCCGTCA TCGCAG AC AC GGTTTTAGGA 
GTTCCAGTAC TCTCTTTTTC TCGCTGACGC TTTGTCCACC . G AAAACTCGG TCGAATCTGC 
ATAAAAGCAG CGGTGTGTAT GCAGAAATCC TGTTAAGGAA CCTAGAGTGT GCGCTCCCCC 
TCGGTTCCTT ATCTGGTGAG GCTTTAGGCG AACTCACGCC CACAGAAAAA CAAAGCTTCT 
CCGTAGAAGC GACCCTTCGC TTCTACGGCG CATATCTCAC TATTGGAAAA AATCCGACCT 
TTTCTAAAAA TTTTGCCAAA TTGTGGCCCC CGTTCATCAC CACACGATAC AAGGAAGCAG 
ACACCCAATA CGCCCCTGGC TTTGGGGGTT ATGGAGGGAA GATTGGTTAC CGCGTAGAAG 
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ACGTCGGGAA TTCCGGGCTA GGTTTTGACT TTGGGTTCCT TTCCTTCGCT TCAAACGGCG 

ACTGGAGCAC GAGCGGGACT AGCCATAGCA AATATGGGTT TGGTAGTGAC CTCTCTATGG 

TACAAGAGAA ACAAGAAGCT GTTTTTAACT GTGGAACTCG CCGGTAATGC TACCCTCCAG 

GAGGGTTATG CCACGTTAGC TCCAACATTT TCGGGAGCAC CCAACAACAA ACGGGCATCC 

CACGCGCTCT TATGGAGTGT GGGAGGGCGT CTTTCGATCA TGCCTGGTGC AGGATTCCGC 
TTCATTTTAG CTACG 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

CGGGCGCGTC GAGTAGTCGG GTGCAGGAGG GATCTGCAGC GAATGACCAT TTTCAGGACG 

AATCGACCAC GGCGAAACTC CTTTAGTCAG TGGCCGCCAG CGCGTGGAAA TAGATGTGGA 

TGGATGAGTC TCCCGTCCCT GTGCAAGAGT TTAAACTTTC TCAGTTTGAG GGTCCTTTGG 

ACcTCCTGCT GTTTCTCATC AAAAAGAACG AGCTGAGCAT TTACGATATT CCTATTTGCG 

AAATTACTGC TCAGTATCTG CAGTACGTGG ATCAAACCGT CTCGCCCGAT CTCCGTGGTC 

TGACGGAgTT TTACGCAATG GCTGCGGTTC TTCTGTACAT TAAAAGCTGC ATGCTCCTCC 

CAATGGAACT AGATCTAGAT GGTGAGGATA TCGAGGATCC TCGGCAGTCG CTGGTGGAGC 

ACCTTATCGA ATATCAAAAA TACAAGCAAC TTTGCAAGCT GATGGAGCTG TATGAGTGTG 

AAGACATGTG GTGCGTTGAG CGAAAAAAGA CGCAGCATCT GTTTTTGTCT CCAGCAGAAG 

TGCCTCTCCT ACACGGTGAC GTTCGTGATT TGCTGATGCT CTTTATTCGG TTAGTGAGAA 

AGACGCCTCA GTGGATTATG GATTTGTACG AAGAAGTTTC GGTAAATGAG AAOCTGACAT 

TGCTTTCGGA ATTGCTTGGG GTTCGGGGGC GGTGTGTATT TACTGAGCTT ATTAAGCA 
(2) INFORMATION FOR SEQ ID NO : 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1959 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
CAACTGTGCA CCGCCGGnTA nAATAAGTGG ACTCGCACAA CGCCAACAGT GTGGATATGC 
GTGCGAAATC TGTTCGCGTC TAAATAACGC TCCCTGTTTC TGCACACGTG CAATAATAGC 
CTTATCAGCA GCTTTTACAA AAAGCCCCTG ATAATCTGCA ACCTCTGCGG TAAAGCGACA 
CTCTGCATCA AGAGGACACT GAATCGAAAT ACCCGCATCC TTAAATACTT CATAATCATC 
CTCTCCGAAC GCAGGTGCGA CGTGTACAAC ACCAGTGCCG TCTTCAGTAG AAACAAAATC 
CGCAACTCGC GTACAGAACA ATCCCTCCTC TGAATCTCCT TGTACAGAAG GATCCGGTCC 
CTGCCCAAAC ACAGGATAAG AAAAAAGCGG CCGATAGCGA ATCCCCGCAA GATGTTCACC 
TCTTTTTTCC CACACTACGC GGTAnAGnAC GAATCTGGAT AaTAGAaTTC AgACGAGAAC 
GAGCCAAAAT ATAGTGCTCA TCATTCGCCT CTATTAGCAC GTACAAAATT TGTGGTCCCa 
GCGCAAgcGC TGCtTGCcGG AgCGTCCAGG CGTGGTGGTC CATGCAAGAA AGCACGTATG 
CGCAGGAAGT GATGCACTTC CCCAAGACGC TGCCGCACAA AACTCCCGTG CAGCAGGACT 
ACCAGGAACA ACAGATGTAC ACTCAAAACG CACAGTAATG GCAGGATCAG ACACATCCTG 
ATATCCACCT AAATTCAATT CGTGATTAGA AAAGcTGTCG CACACCGTGG ACAGTACGGG 
AGTATTTTAT AACCTTCATA CAGCAGTTTT CGCTGCCATA GTGCGCCACA ACCCACCACA 
CGGACTCCAT GTAGCAGACA TCCATGGTTT TATAGTCATT ATCGAAGTCA ACCCAGCGCC 
CAAGACGCGT GAGTGTGCGC TGCCACTCCT TCACATATCG CAGCACACTG GAGCGACATG 
CCGCGTTAAA CGCGCTGACA CCATACGACT CAACATCACT TTTTGAATTC AAATTGAGCT 
CTTGCTCAAT CAGGTGTTCA ATGGGCAGAC CATGACAATC CCATCCAAAG CGACGCGGCA 
CGTACGCACC ACGCATTGTC TGATAGCGCG GAATAATATC CTTAATCGTG CTGGGCACAA 
AGTGACCAAA ATGTGGCAGT CCAGTTGCAA AAGGGGGACC GTCAAAGAAA ACATAAGACT 
TCCCCTGCGC ACGCTGCGCC AC AGACTGC T CAAACACCCG GCGTTCCCGC CAAAAGGCGA 
GAATACGCCG CTCCTGCGCG ACAAAATCAA CCTTTGGGTC CACAGGCGTA TACATACAAC 
CTCCGTTGCT CAGAATCGCA TAAGGAGCGT AAGGCATTAT ATCATTTTCG TCCTTCCTTT 
TCCCCATACG TCTTATGACC GGCGCCACAC CTTTCCCCAC CTGCACCAGA TACCCCACGT 
GTGCGTAATC GCACGTGCTC - TGCCAATTAC TGCATTGAGT ATTACGTATG CAATAATGCC 
C C AC ATT ACA CCTTCTGCAA TCGAATACGA AAAAGGCATC ATCAGAACTG CGACGAAGGC 
AGGAAACCCT TCCCCACATC TTGAAAATCC ACATTGCTTT CCATGCAGCT CGTTACCGTA 
ACAGTGCAGG GAGCGATGGT AGCCGACATT GCAGTTGCAG TGAGAACCGC ACCCCATCCA 
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TTCCTGTATG CGAGCGTATT GCAGGATTCG CCGCAAGCAC GTAAGCCAGT GCGAGAAACG 
AGGTATCAAA AAATGAGCGC ATTCTTATGC ACATGATTCA AAATGACTAA ACCTCATAGT 
AAATGAGGCG CCACCACGGG ATGCAGAACG CACATCGGTA GAAAAACCAA ACATTTTCTT 
CATCGGAGCC TGTGCATGCA CAAGCTCCCG TCCGTGTTTT GAATCCATGC CCAGAATTAT 
TCCCCCCCGC TGTATGATCA CATTCATCAC ATCTCCTAC 
(2) INFORMATION FOR SEQ ID NO: 132: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
GTTCCGTCAG CAATTTCAGT ACGCGGTTGA GGTATTGGGC GAAAAGGTTC TCTCGAAGCA 
GGAGACCGAA GACAGCAGGG GAAGAAAAAA GTGGGAGTAC GAGACTGACC CAAGCGTTAC 
TAAGATGGTG CGTGCCTCTG CGTCATTTCA GGATTTGGGA GAGGACGGGG AGATTAAGTT 
TGAAGCAGTC GAGGGTGCAG TAGCGTTGGC GGATCGCGCG AGTTCCTTCA TGGTTGACAG 
CGAGGAATAC AAGATTACGA ACGTAAAGGT TCACGGTATG AAGTTTGTCC - CAGTTGCGGT 
TCCTCATGAA TTAAAAGGGA TTGCAAAGGA GAAGTTTCAC TTCGTGGAAG ACTCCCGCGT 
TACGGAGAAT ACCAACGGCC TTAAGACAAT GCTCACTGAG GATAGTTTTT CTGCACGTAA 
GGTAAGCAGC ATGGAGAGCC CGCACGACCT TGTGGTAGAC ACGGTGGGTA CCGGTTACCA 
CAGCCGTTTT GGTTCGGACG CAGAGGCTTC TGTGAtGCTG AAAAGGGCTG ATGGCTCTGA 
GCTGTCGCAC CGTGAGTTCA TCGACTATGT GATGAACTTc AACACGGTCC GCTACGACTA 
CTACGGTGAT GACGCGAGCT ACACCAATCT GATGGCGAGT TATGGcACCA AGCACTCTGC 
TGACTCCtGG TGGAAGACAG GAAGAGTGCC CCgCATTTCG TGTGGTATCA ACTATGGGTT 
CG 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
CGACGAACAC TCCAACCCCA AATGACACCT TCTCCGGAAG ATTACCCAAC TCAACCAAAT 
TCCGACAACC TTCAAACAGA CCATCTGCAA GATTCTTCAC TCCAACTGGA AGACGAACCG 
ACGCTAATTC CGCGCAGTCT CGAAAGGCCC CAGATCCAAT TTCCTTCACG GAATTAGGTA 
AAACCACGTC TACAAGGCTT CCACACGAAG AAAACACATC CTGCCCAATC ACCTGCAGCG 
ATCTCGGAAA CACAATTCCT GCCAGACTCA CACACTTGTG AAATGCATGA TGACGAATCT 
CTCGCAAGCC GTCCGGAAGT TCAATACGAG CGAGTTTTGT ACACCCATGC AAATGGCATG 
GCTGGCCCAA TnCATGGCGC ACCGAAACTA GGCAACGnAC ACCCGACACC AGCGGAAACG 
GCACCCATAA AACGCCTTGG TAGCCACCTC GGACAACCGG CAGCCCATCA ATCTGACTGG 
GCAGTACCAC TGCAATGCCC TTGCCCAGAT ACCGCGTGaT AATTATCCCA TCACGGTCGG 
GAGTCAGACC GTACTGaAAA AACTTCGCCG GCGTCTCTTT CACCTCCgCA CGAGAGACAG 
CATTCTGCTT CGCAgAGACA GGATGCGATA CCAGTACGCC CAAACCGATA CACCACATAG 
CAAAAACGAC T AC TTGTTTT GGCATTATTT CGTCCTCCTT TTTAAATCAA CACGAGCCAA 
AAACTAACAC TCACTTCAAT GGATACGCGT CCCACACAAG CGGTGCGCAA sATGgACGAA 
CCCCCTTGAA AAGGTCAACT TTTAAAAAAT TAAGCACTCC CGCAGATATG CGCACCGTCT 
CCTACAGGGA CTCCATACCT CGCCGGTAAA ACTAAGACCG TGAGCCCTCA CCAACGCAAG 
ACAGTTGCAC AGCAGTATGT ACAACTGCGC GCAGACCCGC TCGATAGCAG GTAACTCAAG 
TGCGCTGAAA cCACTCGAAG ATGCTGCTCA TCAATTGCCA CAGTTCCTAT TCCGCACTGT 
TCGAGCAGAT ATTGGCGCAG TGTTTCTGCA GAAAAACCTA TGCAACGGAT ACAGGTAAAA 
TAACCAGAAT TACACGGCAA AAACTCAACA CGCAGCGGTA CCGCAGCCCC ACACTCCGTT 
GTGCTCCAGG TAAGATCACG TACCACGCGC TGCACCTCTC GGTAGCGTGC ACACATCAAC 
TGAAAAAATT GGTGTCTCTC ACGTGCCGTG GCTGAACCCA ACCCAGCAGG CTCATCCTCT 
TCAGCCAACA GACGCAGCGC AAGGTTCTGC GTAAGGTGGC AGTACATGAA AGTGATGCAC 
GGATCATCCC CATGACTTTT TTTTCAAAAG CCTGGTACTG GGAAACACCA' AGAGCGAGTC 
CTGCACAGCT TAGAAAACCC ACGCGTAATC CCCATGCGTA CTCTTCCTTC GTTAACCCAT 
CTATCTTTAA CGCGCAGATA TTCTTGTGCG CTTGCGcAAA GcGGGCAAAA AAAGAGCCAC 
GCATCAAAGA CGCCTCATAC TCGAACCCGC TATACGCATC GTCACAAATC ACCAGTACCG 
CACACCCTGc GTCAgaTTAA GCATACACCA CCTCGTATAA TTGCTGTGCC TCCTCTTCCG 
TGGGGGTATA AC tGACGGAT TATGGGGGAA ATTCAAAATT AACCTTATCA CCCCATCCGA 
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AGC C TGTGCG TCCAGCGCTT CCTTGCACGC GCTCAAATCA AAGCGCCCCG CTCGAAAAAG 
AGAAAAGGGA ACCGGcGTTG CCGCACAACG CACTGCTAGC ATGAGATCAT AATTTTCCCA 
GCGCGGTGCC GGCACCAAAA CCGTCTGTCC TGCACCCACA AATAAGTCCA TAGCACAGGA 
CAACGCTGCC GTAAGTCCCG GCAACACGAT AGGAAGCGAC ATCTTTGGAA AAACTGCCGC 
ACAAGAATGC TCTCCCTGCT CTGCTGCACT CTGCATTGCT GCAACGTCCG GTTCATCCGG 
ACACAACACA GGATCACGCG CACACAAACG cCGCGCCCaG cGCTCGCGGA GCgCAGGAAT 
ACCTGCAGTC GGCGCGTAGG AAACTATTTC AGAAGAAGAA AGATCAGGAA CAAGCGCATG 
CAACGTATCA CGAAGCACCG GCACCCCATG ACGCAGAACC ATGCCAACCG CCCCATTCAT 
ATCAGGCGCA GCATCCGCGC CTCTGCATTC TGTGCAACAA TCCCGTGGGG AAAATACGCG 
CGCAAACCAA GAGGAGATAA CAGCGCGTGC ACCACCGTTC CTTCAAGAGC AGCGTTTAAC 
GCACGCGCGC CTTCAGAGAG GTCCATGT 
(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
CTTTGACCAC TCGGGAAACT CCCCAAACGA ACAAGCCAAC TTAGGGATTC GAACCCTAGA 
CCTCGAGATT ACAAATCACG CGCTCTAGAC CGACTGAGCT AAGTTGGCGA TGACTACGCG 
CCCCGGACTG TACGCGCGAA CCCACGGCTT TGTCAAGCCG CTCTCACTGC ACTAAGAAAA 
AC T AC ACGC G CAGAACAGCA GAGCACGTAC AATGCGGACT CTATACCTTG AATTTCTCCA 
CCTCATCTGC AAGACTGCTG ATGCTCTGCt TGCTACGCTG TGTTATCTCA TTAACCTCGT 
GTAtGCGTTA TTGATCTCGA TGGCACCAGC TGCCATCTCA TTCATACTTC TGGCAATTTC 
ACTAGTTAAG TCGTCCAAGC GCTGCATCTC GCGTGCAATA ACCTCGCCCC CTTTGAGCAT 
ATCCGCAGAC CCCTCCTTCA CGTCTAcGGT GGCGGCATTG ATGCTCTTAA TCGCAGCTAG 
GACtCACGGC TTCCATCCGA CTGCTCTTTC ATCGCCTCTG TCAGCGACCG GCTCATTGTA 
CGCACCTGAT CGGACAAACG GAAGaTGGTA TCAAACTGCT CCTCAACCGC TTTTGAAGAC 
GTGGAAAGCG TATCTATTTC CAC AC TGAGC GTCTTGAGCG TCTCAGTAAT GGTCTTTCCT 
TGGGTGCTAG ACTCTTCCGC AAGCTTACGG ATCTCATCCG CAACCACCGC AAAACCCTTT 
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CCTGCCTCGC CCGCATGCGC CGCCTCGATA GCGGCGTTCA TTGCAAGCAG GTTCGTCTGA 
CTTGCAATGT GCTGAATAAC ACTGCTCGCC TCAAGCAAGC TTCCCGATTC CTsrcTGaTC 
TTTTGCGTAA TACCGCTAGa GCTAACCAGC GTGTCACGCC CATCAGCGGT GGCAATAGCA 
AGACTGTGCA CCGCCTCATC ACTCCGTTCA AGTGTCTGCG TAATAGACAC AATGTTGGCA 
ACCATTtGCT CAACTGAGGA GGAAGACTGT GCGACGTTCA CCGCCTGCGT CTCAATGCTA 
CTGTTCAGCC CCTTAATTGT CTTGATGATA CTTCCACCGT GTCCGTGGCC TCAnACACCC 
CACTCACCTG CAATTCAACT CGGTGCTTGA CACCATCGAT ATTGGCGGTA ATCTCGTTTA 
CCGCACTGGC TGTTTCAGTC ATGTTACTGG CAAACTCGTC CCCGATGCGC CGCATATCGT 
CAGAGCTCGA CCCCACCGTG GCGATACAAA GCGAAT 
(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3856 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 
CCTTGGTGTT GATTTTTTTA TATGTGGCGC nTCGTTTTCG GTGGTTCTTT GCGCTTGGAG 
CGATAGTCGC GCTCGTGCAT GATGCGTGCA TTATGGTGTC ATTCATGGTG TGGTTTGGTT 
TGGAGTTTAA TCTGCGAGTA TCGCGGCGAT TCTGACGATT ATCGGGTACT CGATCAACGA 
CACGGTGGTA GTTTTTGATC GGGTGAGACA GACCATTCTC CTGGATCCTA TCGCGTCAGT 
GACGACAGTA CTTGACCGAT CGCAAACAGA CATGCTCACG CGCACTGTAG TGACAACGGt 
GACGACGCTG CTTGCAGcGC TGATGTTGTA TGTGTTTACC GAGGGAGGCA GTCGGGATTT 
CTCACTCGCG CTAATGGTGG GGATGGTCAG CGGCGTGTAC TCGACCATTT ACATCGCCGG 
TGGCTGTATC GCCCTGATCA GCCGGGGGAA GAGCGGAGGT CAGCTGCTCG GACTCTGAAA 
CCCCGCACGC GCGGCTGTGT GTAGGAAAGG AAAGGAGAGA GGGAAGAGAG GGGACTCGgC 
GCTGCCGACT TTGCTATTTC TTCCTTACAA TCGTTTTTCT TTCCCCGCGG GTGTGcAGGC 
TGCGCGCTTG CCTAAGGTGG AGCCGTTCTT TTTTGTTCTT TTCCAGAAAG TCAAGTGACC 
TGTTAAAACC TTGCAGAAAT TTCTGCATCT CTTCTTCAAA CACCACAACC GCGTGCCGGT 
CAAACCCGGC ACCGTCGACG TTTTTGCTTT CTACTACCTG CAGAAATACA TCTCCGACCC 
TATTTTCCTT CACATTGAAA AAGTAGGTCC GATTTTCCGC TTGCACCTCT GTAGTAAACA 
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GCTCACCGCG TATCCCCATG CTGTACCTTC CTTCTGCTTA TGAAGTCGTA TCCGCACACG 
AAAGCGCACA GCGATCTGCC TCGCGCACCA TGTCCGCGTG CAGCTGCAAA AGCGCAGyTy 
CCCCCGCCTC GTCATTACCT TTCAGATCCG CAGAAATTCT AAATATTGGT TCTGTCTTTG 
AACCGCGCAT CCACACGAAC GCCACCGCTT CTTGGTGCTC GTTGTAAAAT TGGATTTTCA 
ATCCTCCGTC AGCCGACCGG CTAAAATCCC CCGCCGCGTC CTGCTGCTCC CTCCCGCGGT 
ACAATATCGG GCGGTACGAA CAAATGCCGA AGCGTTTTTT TAACGTGTCT TTCTCCTGCG 
CCCACCTGCG TTCAAATACC CGCTGaTATG CGCTCTTGAG GAGTGCATGG TCAGTCGTTC 
GTATCTGcAA GAGCGCGCGC GTATCGTGCG TTGGTGTAGT CGTGTACGCA GgTAGGCTGG 
CAAATAGATC CGATATCGTG AAGGCATGCT GCGCGCGTTC TGTTTGGTGT GAGTGCGCGC 
ACCACCAGGC AAAGAGTCCT GGCTGTTTCC CATCCCCCCG CAGGAGCAAG AGCTTTAAGA 
GTGCAAAAAC AGTATGCAAA GGATCACGTA CTGCCGCCGG ATGGAGAATG CTGCCCCCGT 
TTGATCCTTC CCCTAGAATG CGCACGCAgT AGCCTTCTTT CCGTAGAAGG TGTGCCTTCT 
CAATGAGGTG CGCCTCTCCT ACCTCCGTGC GAAAAACGTG CACGTCTAAG AgCTGCGCGA 
TAGCTTCTAC ACGCAGCGAG GTGGGACCAT TGGTAACCAG CGCGATAGGC GGCGCGCGGT 
GTGGCTGCAT ACGGAGGTTG CGACTGAGTT CACAGATCTC CGACACAACC GAAAGAGCAA 
AGACTGCCTG TTCGTGCGGA ATGACAGCGC GATTAAGGGT TTGGTCGTAA TAGACAATAT 
TTCCCCGATC TCCGTCACAA TCTGGCACAA AGCCGAAGGc AATGGAGCGT TCTTCTGGGG 
ATGAACCTCG CGTGGCTGCC TCGGTCAGTG CCTGCGCGCA CGCGgTAAGA GAGGACCCTT 
CAGGAACTAT GCGATGGCGA ATATCCCCTG GCGTTTCGGC GATACTAAAA AGGGCGACAC 
CGAGTGATTC TAGGAGGCGC CTATCTATGG AAGCTGCACG CGCACTTCCA TTGAAATCAA 
TGAGGATAGA CAGGGGTGTT CCCTGTTCGC TGTAGGCGGC ACGCTGCTGG GTAAAGCGAT 
GGAAAAAGGC ACGGTGCTCA GTTTCTTTAG GGGAATTTGC GATCACTTCC CTTATGAAGA 
GATCATAGCT GTGCAGGCTT TCTTGTTTGT GTGTTTTGTT GTACAGCAGC GGTTCGAGTA 
TAGAGGCGTT GAGCGTGGCA CAATGCTCTC GTATTTTCTG GAAAGAGGCG GGCTGTAAAC 
ACTGTGCGAC GAAgTCTTCG CTCAGCTGTT GTGCCTGAGT AGGACTGAGC ACGCCGCCGT 
CATTTAGGCC AAATTTAAAG CCATTGTATT CGATAGGATT GTGACTTGCG GAGATGTAGA 
GGAAAGCGTC GTAGTTCCTC GCGTAGCTTG CAATTTCGGG TATTGGTCCG ACTCCGACAC 
AACGCAGCGA ACAGCCTTCC AGATGGAGGA TAGCGGTGCA GATAGAACTG ATAATCTCTC 
CAkTAGGGCG CGAGTCGCGC GCGATGACAA CACGTGGTTT TGGGACCTTT TTTTTCAAGA 
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ATCGGGCGTA GCTGAGTGCT ACCTGTGCGC TGAGTACAGC GTCAGCCTTT TGCGTGAGGG 2640 

TGACGGTGCG CTCAGGGGTA AAAGCGTAGG GGAAAACCTT GCGCCACCCT GAGGGTGACC 2700 

GGGTGAGTGA TAGATGAAGC GCTGCGCaGG CTGCGGCAAG GTGAGGGAGG TGATGAGTGA 27 60 

GTTGAGCAGG GACATGCGCG GAGGATAGGG CGCGGGTGAA GGAAGTGCAA GGGAGGTGGT 2820 

GGTCTTGACA GGATGAAATC AGGTGTGGTC GGGTGTAGGG GCGGGGGAGA GGTAGGAATG 2880 

GGAGTAGTGG AATGGGTAGG TGAGTGGATG CACGCGGTGG TGTGGAGCTT TCCGATGGTG 2940 

GTGCTGTTGC TGGGGACGGG GTGCTACGTG ACGGTGTGTA TGAAGTTTTT TCCTGTGGTG 3000 

CGGCTGTGGT ATGTATTAAG ACAAACTATT GGGGGTCGCG GAGGTAAGAA GGGCGGCAGT 3060 

GGTGAGGTGA GTGCGTTCCG TGCGGTGTCC CACTgCtTGC AGCGACGTTG GGGTCTGGAA 3120 

ATATTGTAGG GGTTGCGACG GCGATTGCGA TTGGGGGGCC TGGGGCGATA TTTTGGATAT 3180 

GGGTGACGGG GATATTCGGG ATGGGAACGA AtTCGtGGAA GTGGTGTTGG CGGTGTACTA 3240 

TCGGCGTCAG ACTGGTGATG GGCGTTTTGT GGGGGGGCCG ATGTATTATC TGAAAGACGG 3300 

AGTGGG AG TT CCAGGGGCTG GGGTACTTGC GAGTTTGTTC TGCATATTCA GTGTTATCGC 33 60 

GTCCTTTGGG ATAGGAAATA TGACGCAkCG AAcTCGGTGG CTCTAGTGTT CGAAGATGTG 3420 

TTTTGTGTGG ACGTGCGGGT GACCGGGGCA GTGCTGATGG TCTTGGTAGG CTTAGTGAGC 3480 

GTGGGTGGGT TAAAAAGTAT CAGTTGGGTG ACTGGGGTAA TGGTGCCTGs GATGGCGATT 3540 

TTGTATGTAT GTGTGGGCGT ATGCTGgTGT GTTGCAATAC GCGACAgCTG GTGCCAGTGT 3600 

GCTGGGATAT CGTGTCCGGG GCGTTTGCCG GGACTGCAGC AGTTGGGGGG TTTGCAGGGA 3660 

GTGTGGTGCG TCAAGCGATA GCGGnTAGGT ATTAGC CGGG GGGTAGCGGT GAACGAGGCA 3720 

GGGCTTGGGA CTGCTCCTAT TGCGCATGCG GCGGCTATTA CAGACCATCC AGTGnCGACA 3780 

GGGGCTTGTG GGGTATCTTT GAAGTTATTT GTGGGGACAA TGGTGGTATC TTCGGTGACG 3840 

GCATTTGCGA TACTGC 3856 
<2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 
CGACCTAGAG CGTAnACTGC TGGAGGGACC GCTCCGTCCA TTATACGGAC GATGGAGGGA 
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GGAAGCGCTG 


CCGAACACCA 


CCTTTAGAGT 


180 


GCAATTGCAG 


AATGGGCACG 


AGATCCTTGC 


GTATCTTTCA 


GGTCGGATGC 


GCAACACTAC 


240 


ATCCGTATCG 


TGCCCGGAGA 


CTCGGTGAAG 


GTCGCGCTCT 


CACCCTACGA 


CCTCTCCCGC 


300 


GGCAGAATTA 


TGTTTCGTGA 


GCGTTAGATT 


CCTCTCTCGC 


AGgAGAGAGG 


TGCTTGCTCC 


360 


TCTTTAGCAG 


CGGGGTTGCT 


CATCCTTCAA 


GGAGGATTGA 


GAGTCCTGCC 


CTCAGGGTGT 


420 


GCACTCCGCG 


CGCCAGGCAG 


GACACGCTGA 


ATGCGATCCC 


GAAGAAGCCA 


AATACTAAAA 


. 480 


AGAAGCGTAC 


CCGaATAGAG 


CCAGGAGGAC 


GAGCGCCGCT 


CCCTGAATGA 


GAAGGCGCGC 


540 


CCCGTCTTTC 


CTGGAGCCCG 


TGCGCGCGGT 


AGGACCCGCC 


GGATCGGGCG 


GAGGTCTGCA 


600 


CTGCTCCAAT AACTCCACCC ATTTCCAgGA AAACCACCgA AGAAAGTGTA CCATCCCTCG 


660 


AAGTGTTCGG 


TTTCCCGCGC 


GTGCCCTCTC 


CAGGGACCGT 


GCGTGTGTGT 


GTAGCGGGTG 


720 


CGCGCTCCTG 


CCTTGTCGTC 


CGTACC TGGA 


GGTCCAAGCG 


CCAAAGCCCT 


CTGaGGCGTc 


780 


GCGTcCTGcA 


CGcGaGGaCC 


AAAATGcGcC 


ACCTGcGcGT 


TcGGcATcGT 


aGCGTGCGCG 


840 


TGATGCACGG 


TCAGAGAGGA 


CGGCGTAAcT 


GcGTTGaTCC 


TTTTAAACTG 


GTCCTCCGcA 


900 


CACGCGTCGC 


CCGGATTTTT 


ATCCGGATGA 


TACTTAAGCG 


CCTGCGCGCG 


AAAAGCTTTC 


960 


TTAATGTGTT 


CCTCAGAAGC 


ATCGGCAGCA 


ACGCCCAGTA 


TGGCGTAATG 


GTCAGGAACA 


1020 


GTGCGCTCGC 


TCATCGTTTA 


CCCAATGGTG 


GTCCCCACGA 


ACTCTTGCCT 


GTTCAGGATG 


1080 


CGCTCCAAAT 


TCGGTAAGTC 


ACGCCCTGTT 


GCGCAAATGA 


CCTGAATTCC 


CGACTTTTTT 


1140 


GCCCGGACAC 


TAGCGACTGG 


GTCAAAGGGG 


ACATGGCTGC 


CCGGTACCCA 


CTCCTTGTCG 


1200 


ACAAGGAGGA 


GGAAATCATC 


CCAGGAGAGG 


GATGTGAGCG 


CCTTCGCATC 


CTTGTCTGAA 


1260 


CGAGGATCCC 


CCGTGTACAC 


ATGCGCAACG 


TCGGAAAGGT 


TAATAACCGT 


CTTTGCAGAA 


1320 


TAACGCTCTG 


CAAGGAGCAC 


AGCGTCGGTG 


TCGGTGGAAA 


AACCCGGTTT 


CCAACCAGCA 


1380 


GCAACGAGCA 


CCTGACCTGA 


AAAAACGTTC 


GCCGCAGTCG 


GGTCATACAC 


GACCGGATTT 


1440 


GGACAAAGGA 


TTCCAAAAAG 


GGATTTGAGC 


AGCTGCGCGT 


TCAAACGCGT AgCCATGaTG 


1500 


CCAATCCAGT 


CAAGTTCAAC 


GTGTTCGGCC 


GTGGCATACA 


GCTCCCTGTC 


TTCCTGATTA 


1560 


TCGCCCTCGT 


GCGCAGGTTT 


CGCACAGGCC 


GGAGAAGACA 


GGCTCCGGCG 


TAgcGCAcGA 


1620 


TAGGCGTTTT 


GATAAGTGCG 


CGCAGGTGCA 


CCACCGCCTG 


AAACGACAAT 


GAGCTTCCGT 


1680 


GAGGCGTCTT 


CGTATAGGTA 


CCGTTGAACG 


GAACGAACGA 


ACCGCCCGAG 


AAGCTCTATG 


1740 


TCGGGCGTCT 


CAGGCGCAAC 


GATGGAACCT 


CCAAGTGACA 


GAACGGTGAC 


CATGAAACCC 


1800 
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TCTCGCCGGC ATCGTAACGC AAAGAGACCC TTTGGATCCA GGCCCTGTGT GTATCTGGCA 
TTGCGTCCCA GCGTGCACGG GGCGATGGAG TGTTCTACAC GGGCGACACA GACTCCTAGT 
TCTTGTATTC TGTGCAAAAA CCGACGTAAA AACTGTATCT CCGTAGTCTA GTGAGTGTTC 
TCTGTGCATG AGTTGCTCCC GTACGACCGG TGCTTTACGC GCGGTCCCCC TTGTGTTCCG 
TTCCGTCCTG GTGCTTGCGG TGTGGGGTGT TTCCTGCGTA CAAGCCGCCG ATGTGGCGCA 
CAATGCGGAT GTACCTTCCC GCTCGCTGAA GGCGCTCGAG CGTTTCCGTT TTTTTGTGTA 
TCCCAAGCCG CTCGACCTTT CTAGTGACTT TCATGCGAAG GCCTTGAAGG GGGAGGCACT 
GGTTCCTAGC CTTTTCAAGG GAAAGGTGAC GCTTTTGAAC TTTTGGGCTA CGTGGTGTCC 
GCCCTGTCGT GCGGAGATGC CGTCTATGGA TCGCATGCAG GCTCTTATGA GGGGGAATGA 
CTTTCAGATT GTCGCGGTCA ACGTTGGTGA CTCGAGAAAA CAGGTGGAAA GTTTTATCGC 
GCGTGGAAAG CATACCTTTC CTATCTATCT TGACGAGGAG GGGAGTTTGG GGAGTGTTTT 
TGCTTCCCGT GGTCTGCCAA CTACTTATGT TGTGGACAAG GCAGGGCGCA TCGTGGCAGT 
GGTTGTCGGG AGTGTGGAGT ATGACCAACC AGAGCTAGTG GCTCTCTTTA AGGAACTGGC 
GCGTGACTAG TGTCCCCGGC GTTGTGGGTT CCTTTTTGGC CGGGTTGCTT TCTTTTCTCA 
GTCCCTGCGT CCTGCCGCTT ATTCCGGCGT ACGTCTCTTT CATTTCGGGA GAATCGCTCG 
GTTC TATCCG GGCGGGGGCG GCGCGGCTCC AGGTTTTTCT CAGCAGTGTT TTTTTTGTAT 
TAGGACTGAC GACGGTTTTT GTGTTGTTTT CAATCGTATT TAGCGGAGGG GTGCAGCTTG 
CAGGTGCGGG TGTGCTCACT GTGCTCACGC GTGTAGCGGG CGTGGGGGTG ATACTCCTCG 
GCTTAAACAC AATCTTCGAC GTGGTTCCGT TTTTGCGTGT GGAAAGGCGT ATGCACACAA 
CGGTGCGACG GGTGGGTGTG TTTCGTGCGT ATCTTTTTGG GTTGCTGTTC GCAACGGGAT 
GGACTCCGTG CGTGGGGCCG ATTCTCTCTT CTCTGTTGTT CTATGCGGCG AGTTCTGGGC 
AGCTGCTCCA CGCAGCAGGG CTCCTGACCG TGTATGCACT GGGATTGGGA CTTCCCTTCG 
TGTTTGCAGG GATCTTTTTT GGACGTGCGG AGCGGGTGTT TGCGTGGGTA AAGAGTCACA 
TGCACGCAGT AAAGCTCGCC TCCGGGATGT TGATCGTCTT TTTCGGACTG CTGATGCTAA 
CGTCGGGGTT GCAGGCACTC AGTCGGCTTT TTCTACGGGC AGGATTCGCG TTAGAGGAAT 
ACTCGACGCG GGGAATAACC CCTCTTCGGC AAATAGCGGC ACTTCTTGCG CAtGGTTTTT 
GTACCAGGGG GTTTGAGCGC GAGcGGCtTT GGGGCTGTGC GGGTGGGTAG CCATCACGTA 
AATAGTTTTT TGATGCGTGT GAAGGCCCGC GTGACCTCTT CCTCGCTTAT TTTTCCTTCC 
GGTGCATGGG CGCGCACCAG GTGTTTGGGA ATTTCGTCCA GAAAGGGGGA GGCGGTGCGC 



1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 
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3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 



Printed from Mimosa 02/03/22 07:30:04 Page: 772 



WO 98/59034 




TCTTCCCTTT TTCCGCCGCG TTTCCTTGTG CGGCAGTGTG TCAGGTACAA CTTCTCTTTT 3600 

GCGCGGGTAA .TTGCAACGTA GAAAAGGCGT CGTTCTTCCT CGATGCTGTG AACCTCTTCT 3660 

ATACTTCGTT CGTGTGGAAT GGTTCCCGCT TCCACGCCGG CGATAAACAC TACGGGAAAT 3720 

TCCAGCCCCT TCGACGCATG AATTGTCATG AGCGACACAG CGCCCTCTGT TTCTTTTTGG 3780 

ACGTTATCGC GTGCTAGCAA CGTAACGCGG TTTAGGTAGT CGTACAAGCT TCCGTGTTCT 3840 

GAACTCTGTT CCCAGTGTTC GATGGATTCG ACTAAGTGTT CGATTTGCAA GAATTTAAAA 3900 

CGTGCGGCAT GTTCGTTTTT TTGGAATTCT TGGATGAGGT AATTAAAATA CTGAATGTCT 3960 

TCAACAAATT TGCGTACCTT GTACGCAAGA TTTTTCCCCG AAAGTAGATG GGTACGCGCC 4020 

TGGGTAATGA GC TGGAG AAA ATTTTCCACA GCGGTACGAT GTGATTCTTT T AAATCG AC T 4080 

GCATGTGTTT TATTGATAAT TTGATTCAGT GCATTGAACA CGGAACATTG TTGTGTATTA 4140 

GCAATGTCGG AGACTAGATG CAGTGTTTTT TTTCCAATTC CTCGCCTCgG GGTATTAATG 4200 

ATGCGTAATA GGTTGACATC GTCGTCAGG A TTAGAAATTA CCCGCAGATA ACTGAGCACA 42 60 

TCCTTTATTT CTTTCCTCTG AAAAAAGCTC ATGCCGCCTG AGACACGGTA TGGAATATTT 4320 

TCTTGCAAAA ATACGTCTTC AATTATGCGC ATAAAACTAT TCGTGCGGAG TAGGACTCCA 4380 

AAACTACTGA AAGAATAGGA TGCGCGTATT TGTTCTGCGA GAATCGTGTT TGCAATAAAG 4440 

ATTG 4444 



(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

GGACGCAAAC GGCGTTCnCC TTCAAAGGCG CGCTCGATAC GCCGCGTTCA ACCCCATAGC 60 

GTTGCCGGTT GCCGCTTCCn AGACTTcCCC CttACCGTAC AGTgCACGCG CGCGCGAACy 120 

TTCCGCAAaG CAGACAGCGA TACTGTCCCA GACGCAGGCT TCCACCCGTT GGGgTATyTA 180 

CACAACCGGG CAAAAGATCA ACAACCGGGT GAGGGGTGTC TACCGCAAAT TCTCGAGAGT 240 

GTGCACTTGC CAGCAACAGT ACATTGCGCG CAAATTCAAT GACTGCAATC TGCATCCCCA 300 

AGCAAATGCC AAGATAGGGC AGGTTCTGCA CACGCGCGTG CGAAACTGCA CAGATCATCC 360 

CTTCAATCCC CCGCACGCCA AAGCCACCTG GAATTACCAG CGCATCTGCG TCCGCAAGAG 420 
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CGTGCCCAGC GTCCTGCACG CTGCAGATTT CCTCTGCGTC TATCCAGCGC ATGTCCACTC 4 80 

GCGCACGATG GCAAATACCC GCCGCCGTcA ACGCTTCGCT CACGCTCAAA TACGCGTCCG 540 

CAAGCGACAC ATACTTACCC ACCAGCGCCA CCGTAAGCTC CCGCCGCGGA TAGTACAGTG 600 

CACGGACCAT TGCGCGCCAC GCCGTAAGAT CTGGCTCTGC CCCCTGCCCT GCTGCAGGAT 660 

GCAGACCACC GTCTGTACGG ACCGTCCCCC CCGCTCCCAA CACAGCCGAC TGCGCGCCTC 720 

CTGCACCGAG ATTACGCGCC AcCTGCCCCC CGCAGCAGGT ATCAAAAAGA CGCAAACGCT 780 

CACACAAGAG CGCTCCCAGC CCTTCTGCTT CCAAAAGGAG CGGCACCTCA TAGATAGAAC 840 

GAGCCGTCAC GTTCTCGACA ATAGCACGCC GCTCAACATT GCAAAAAAGG CTCAGCTTTT 900 

CGCGCACCGC ATCCGTGATG TGACGCTCGC TGCGGCACAG GATGACATCC GGCTGCACAC 960 

CAAGTCCTAG CAGCTCCTTT ACGCTGTGCT GTAGCGGCTT GGTTTTCATT TCACCACAAC 1020 

TGGGTAAGTA GGGAACTAAA CCCAAATGAA TAAAAAGACA GCGCTCCTTC CCCAGAACAC 1080 

GCCTGATTTG ACGAATCGCC TCGATGAACG GAAGCGACTC TATGTCACCG ACGGTACCAC 1140 

CGATTTCGGT GATAACCACC CGAGCCCCCG TGGTAGCAGC AGCGGcGCGA ATTCGCGCcT 1200 

GgATTTCATC CGTAACGTGA GGAATAACCT GTACGGTAGC ACCTCCGTAT CCTCCCGCGC 1260 

GTTCACGGTC CAAAATAGCC CGGTACACGC TCCCCGCAGT CGTGCTATTG AATCTACTTG 1320 

AAGGCACGTC CGTGAAGCGC TCGTAATGGC CCAGATCCAG GTCCGTTTCG CCGCCATCTT 1380 

GCGTGACAAA CACCTCCCCG TGCTGATAGG GATTCATGGT ACCGGGATCC CCATTCAGGT 1440 

AGGGATCAAA CTTTTGATTC ACCACGGATA TACCCCGGcT C TTAAGC AAA AGTCCGATGG 1500 

CAcTGCGGCA ATACCCTTGC CCAGCGAGGA AACTACACCG CCTGTAATAA AGATAAAAGC 1560 

CGGATCCATA CGGCACGAGT GTAGCGTGTC CATGCGTTTT TTAAAAGGGA GCACCCTGCC 1620 

TCTCCCCTGC GCCTGTAGGG TGGACTGTCC CATCGGCTTG CATCGAACAT TGCTTTCGAA 1680 

TACCATGGGC CCCATGAACG CGCGCGCTGC CCTCGCTCTT GCCATGGTCT TCACTTTCCA 1740 

GAGGTTGTGC GCAGAAGAAC GTTTCGTTAT CTCCACTGAA TACTTCGACA TCATCTACAC 1800 

CGAGGCCTCG ACTGAGTCTG CGCGTATACT GGCGAAGCAC GCTGACCGGT ACGAAGAACA 1860 

AATCAGCCTC ATGCTCAATC GGGTTCCTGA CAAGAAAAAG CGCACCACCG TCGTCCTGTA 1920 

TGCCCACACA CAGGATGCAG GCGGGTCTTT CTCTTCCAAA CCGTCAAGGA AAATAATCAT 1980 

CAACGATACG CGCGTTCCCA ATCTGGGTTT AGGCAGTTTT AAGGATTCGC TGCTGAGCAT 2040 

TTTTTACCAC GAGcTTACGC ATAAGATTTC TCTCGATTTC TTCATGCCGC TACTCCCTCC 2100 

CCTCTTTACC GAAGGGGTTG CCGTCGCCTT TGAAAGCAAC GACGGCACGC AAGGGCGACT 2160 
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GCACGACCCG CTGACAATGC ATTACTTAAT TCAGAACAAA CTGGAAAACG TCTCCCCCTC , 2220 

CTGGAGGGAG GTTGCCGAGC TCAGATACAA CTACCCCCAC GGTATGCCCT ACGTGTATGG 2280 

GGGAAAATTC ACCGAATACC TGCAGAAAAT ATACGGCAAG GAGCGGTGCG CCCGGCTGTG 2340 

GCAGAACTCC TGGCGTCTCT TTATTCGACA CCGTTTTTGG GACGTTTTTC AAAAGAATCT 2400 

GGGAACTGCG TGGAATGAGT TCATCGACAG CATTCCAATC CCGGAGAAGG TGGCACAGCC 2460 

GCAACTCCTT TCTGAACGGG AAGCGCAGGG TCACTACGGT GCACGCAGCG nTGCGCCGAn 2520 

CGGctTCGCC TATTATGACC GTGACCGTCA CGCGGTGCGT TTCCGTGACA AAGCAGGTGG 2580 

GGTGCGCACG CTGTTCTCTC ATGACAACAC GCTGCATCAT CTGAACTTCT CCGAGGACGG 2640 

ACGCTATTTG GCAGTATCAG ACACCATTGA CACGTGGAGC GAGCGCACGC ACCGAGTACG 2700 

CGTTTTTGAC ACACACTCAG GCTCGTTCTC GCCGGAGGTG TACACTGGCG CGTCCGAAGC 2760 

TTGCTTTGTG GGGAACGGGC AGAAAATAGT ATTCGTCCGG GTGCAGGGGC AATACTCGCG 2820 

CCTCACGCTC AAGGACCGCA CAGACCCTAC cTTCGAGAAG GTGTTATACG AAGCAGGTCC 2880 

TGGTTTGCCA TTCGGTGCGC TGTACGCGCC GGcGTACGCg sTGaTGGCAC CGTCGCCATT 2940 

ATTGGCGCAC GCGGTATGGA GCGTAACTTG CTTTTCATCC CGGTGGACGA CAGGCCAATG 3000 

ATGCAGGTTC CGCGCGAGCA GATGC CAC AC GCAATGCGGG AACTGCAGTC GCAAAAGATC 3060 

aAAGGCTCGT GGACGCTCAC CTTTAGCTGG GCAAATATGA ACATGCTCTC TCGCCTAGGT 3120 

TTTTACGACG TGTCCCGCCA TACCTTCAGA CTAATGGACC AGGACGTGTC AGGGGGGGTG 3180 

TTTGCACCGG TGGTGTACGA GGCACTGCCT GCTGCGGTGC ACGAAGAGTC TGCAGCAGAG 3240 

GCGACCATAC GCGGTGAAGA GCCGGTTGTG CGTGTGGCCT ACACCGGCAG ACACCGCATG 3300 

CACATGAGCC AATACCAGCG GGATGACCsC GCCCTGCGCG AGCGGCGGGT GTCGCTGGTT 3360 

CCCTTGCACC CGGCAGAGGC GGAGGAGCAG TCGCGCCCCG CCACGCTCAT GGTTAACGGA 3420 

GAGTTCATAG CAGACGTGCA CGAAGCCGAC CGCGGCAGAC GGTACCGCGC AgCTCAGTGG 3480 

ATGTGGCCTC CCACGTTCTC ACCCCGCTTT GTGCCGCCAA ACAGCTTCAG CAGCCTCAAA 3540 

GACCTTGGAC ACACCGGACT GGGGGTGAAC ATGAAGTTTG CCGATCCGTT TGGGCTCGTG 3600 

GAGGTGAATC TCCAGTCGGT TTCTCATTTC TATCCGTTTT TTACCTCGCT GGGGCTGAAA 3660 

AGCTCTTTTT ATGTGGGCAA AACCACGTag CCCTACGTGC GTATCACGAG ATAGACACCG 3720 

GAGGGTTCCG GTACACTAAG CTCGGCGGGG CCTTTGAAAC GCTGACAAAC TTTCCTATGC 3780 

AGGATGACCG AAACGCGTTT TTTGTGCGAA CTGCCGTGGG GGTAGACTCC TACTCCTGCC 3840 

TGTGCGCCAA CGGCGGGGGT AACGGGAATT GCTGCGGCAA CAACGGGGGG CAGCAGTGcT 3900 
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GCGCCTGCAA TGGACAGGGC GCAAACGGAC CACATTATTA CAAGAGCCTT GAAAGCCCCT 
TCATTCAGGC ACAGGTGGAA ATGGGGTATA GCTTTTCTCA GCGTGCAGAG CGCACGGGAA 
CAAACTGGTT CGTGGCGGAC GTCACGGGGG TGAGCCTGAA GTTACACGTT GCAAATAGTT 
TTGACACCGG TAAAACAAAA GACGCGGTGC TCGTGCAAAC GAAAGGGTCA TTCCGCCTGC 
CGGTGGTGCC CCTACGGGTG GGCGTCAgcG CGTACGTTGG GTATAACGCC GGGTGGCGCG 
GGGGCAAAGG AAACATTCTG GCGGAGCACC CAGTGTACGG CTTTCCCGGT CCTACGTATT 
TACCCAAGCT CGCAGGGGTT GGTGGTATGG AGGGCTCGTG TAAAAAAAGC AAGAGTGCAG 
GGTTCGGCGC TGAAGCAGTG CTCACCATCT TGGACTACGA CATCAGCATA TATGATCCCT 
ATCTGCCCGT CTTC TACCGA AATATTGTTT GGAACGTAAG CTGCGAGTAC GTGCTCAATG 
CGCCAGACTT TTCCTCACCC AAACACCTGT GCGTTGCAAG CACATCACTG GTTTTGGAGT 
TTGACCTGGC GGATGTAAAA GTACGGGCCG GGGTTCAGTA CGGCTTCCAA CTTGCAGAGA 
CGCAGAGCGC CACAACCCCC GGTTTCAGCC CGATTTTTTC CATGGCCGTG TAGGAAGGTT 
CCCCCGTGGC GTGTGCAAAG CGCGCTACGG GGGCAGCGCT TCGGCAAAAC GCGGGAGCTG 
GCTGTCTACG CTCTGGCAAG GGTTTTTCTA TGCATGAAAC GTCCTACAAA CCTATCCACC 
GGCTGTCCCC TGACACCGCT AAAAAAATCG CCGCAGgAGA AGTCATCGAG CGGCCCGCCT 
CCGTCGTGCG CGAATTGCTC GAGAACGCAC TCGATGCAGG CGCCACCAAA ATCCATCTGG 
AAATTAACGC AGGCGGcTGC GCGCTCATCC GCGTGAGc g A TaACGGCCAC GGCATGTCCC 
CCCAGGATTT GTTGCTATGC GCTGAAGCAC ACACCACGAG CAAAATATCG TCTGCAGACG 
ACTTATTGCA GcTGCGCACG TTAGGCTTCC GGGGAGAAGC ACTCGCCTCC ATCGCCGCAG 
TCAGCCGCCT GCACCTTACG AGCACCCGAT CAGGGCCCCT CGCGTGGCAC TACCAGCCAA 
AGGCTGCAGG CACTGCACGg CACGTACCGC CGGTGCCGCA GGgCACCGAA GCAGGCGTGC 
TAGAGCCTGC AAGTCTTGAG CGAGGCACCG TCGTACGCGT CGAGCAGCTT TTTGAAAACT 
TTCCTGCGCG CAAACGCTTT CTCGGACGCC AAAGCGCAGA GACCACCCTG TGCCGCAGCG 
CACTCATCGA CGTCTCCCTC GCACATCACC CCGTGGAGTT TCGCTTCACC GTCGACGGAA 
CGCACAAGCT CACCCTGCTC AGTCAGCAAA CCCGGAAGGA TCGGTGTCTT GAAACGCAAA 
TGCTCAAAGG AGATCCTGCG CTCTTCCACA CCATAGAAGG AGGTGACTGC TCGTTTCACT 
TTCACCTTGT ACTTTCAGAA CCCGCCATCT GCCGCAGAGA ACGCCGCGGT ATTTTTACCT 
TCGTCAACGG ACGACGCATT TTTGATTACG GTCTTGTCCA GGCACTTGTG TTAGGAAGCG 
AGGGATACTT CCCCAATGGC ACCTTTCCGG TCGCCTGCCT TTTCCTCACC GTTAACAGCG 
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AACGTATTGn ATTTTAATAT CCACCCTGGC CAAAAAGGTA GGTTCACTTn ACAGG 
(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 659 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
AACTACTCTn AGTAGGATCC CCCTACACTn TATCGCACAC GGCGCACAGA CGGAACGGAG 
CGCCTGTGTT TCACACATTG AATTTAAAAA AGATAACGTC CCCGTCTTGC ACCTCGTATT 
CCTTCCCCTC CTGC CGAACG CGGTTTGCCT CCCTCACCTT TGCCACACTC CCACAGGACG 
CAAGATCATC GAAAGAATAC GTTTCTGCAC GAATAAAACC ACGCTCAAGG TCGCTGTGGA 
TCACTCCTGC CGCGTGCGGT GCAnAGnCCC TGCCCGAATG GTCCACGCGC GACACTCCTC 
AGGCCCCGCG GTAAAAAAGG TACGCAACCC CATCAGGGAA TACACTGCGC GCGCAAnGCG 
CACGTCCTGA TTCGCGCAAC CCTAATTCTT GCAAAAAGGC GTTTTGCTCT GCCACATCAG 
AAAGCTGCGC AAGTCTGCTT CAAATTTTCC ACACATAACA ATTGCCTGCG TGTTATGCAC 
ACGTGCGTGC TCTTGCACCG CGCGCACGAA ATCATTTCCG TAnTGCATGC CGCTTTCGTC 
TGTATTGCAC ACGTAnAGGT GCGGCTTCAT TGTCAACAAG CGCATATCGC GCACCGGTTG 
CGCTCCTCAT CCGACAGCGG CGCCATAnAT GnCGCTTTCC CATTTCTAAA TATTCGCGC 
(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1229 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
AGnTTCCTCC GCCTCCTTGC gCCGGGCTtC CTCCGCCTCC TTGCGCCGGG CTTCCTCCgC 
CTCCTTGCGC CGGGCTTCCT CCGCCTCCTT GCGCCGGGCT TCCTCCGCCT CCTTGCGCCG 
GGCTTCCTCC GCCTCCTTGC GCCGGGCTTC CTCCGCCTCC TTGCGCCGGG CTTCCTCCGC 
CTCCTTGCGC CGGGCTTCCT CCGCCTCCTT GCGCCGGGCT TCCTCCGCCT CCTTGCGCCG 
GGCTTCCTCC GCCyCCTTGC GCTGCAGCCA CTGCTCAAGC GCCGCTATCA TACGCTGCAT 



5695 
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TCGTAGTGTT GTGAAGCAGT CGGGTTTTGG GACACGTACT TTTTATAGTC AGCCAGTGCG 360 

TGTGCATAGG CTTGTCGCTT CATGGCAGTG TTTGCGCGAT TGAGCAAGGC AGGAGGGTAA 420 

TCGTGCTTGA GGGCGAGGGC CTGCTCATAC GCATGTTGGG CTTCCTCGTA GCGATGTTGC 480 

ACGAAGTATA CGTTCCCGAG ATTGTAGGCG TATAGGTGCG CATATTCCTG ACTGTGTACT 540 

GGTGGGTTTT GCAGCCACTG GATTGCcTGC GTGTAGCGTC CTGTCTGTAG GTACGCCATT 600 

CCCAAGTAGA GTGCTGCTTT CGGGTGGGCG GGTTTTTGCT GTGCTGCTTT GTGAAGTGGA 660 

CCGATGGCCT GCTGTGCCTG CTGTAAGGAA AGGAGGCGCT CTCCTTCACG AAAGTGG t C A 720 

AAGGCATCCT CTGCCCGCAG GAAGGACGCA AATGCGAGCG CAAGAAAGGA GAATAAGCGA 780 

ACGGATGGAA GGGAGCGTAC GCkTGCGTAG GTGCACGGTG AGGATTTTTG GCACATTTGA 840 

CATTCCTTCT AGGGTGCGCT ACCATGGGCG GCATG TCCGC GTACATGGCA C TGCTGGC AG 900 

CGGCGTTCTC GTCAAGTATC GTCTTTTTGC TCGTTTTTTT GATGAGAGGT TTTTCCATCC 960 

CGCGCAGACA ACTTTTGGTG GAAAAAAGTT TTCGAGACGG CAAGTACGCG C TTGCTATC A 1020 

AGCATGCCCA TGCGGTTTTG GCTAAGGATC CCCATAACTG GGCAGTGCGT GTATTGCTCG 1080 

GTCGTGCGCA TCTtGCGGAA GGgAAGcGCG ACGTCGCGCT TATGGAGCTG CGCGcTGCCA 1140 

GCAGCAGAGC TkCGTTTCGG AAAgTGGtAr ATGAa G t Tg A gTTTcGCAAG AC T ATTGC AC 1200 

AGCTTATCTC CAGTTTGACC AATCGAnGA 1229 
(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

TTGGGACTAG CGCGTACTCT TACCGTCGGG GTTGTTTCTG CTCTCGCGCG TCCCATTCAA 60 

AATAAGGGCA GTATCATCAG AAATATGATT CAGACGGACG CGGCGATCAA TCCTGGGAAT 120 

TCTGGTGGTC CGCTTCTGGA TACCCAGGGG CGCATGATAG GCATTAATAC CGTTATTTAC 180 

TCTACATCTG GAAGTTCTTC TGGTGTTGGC TTTGCGGTGC CAGTAGATAC CgCAAAGCGC 240 

ATAGTGTCCG AGCTAATTCG CTACGGCCGT GTGCGTCGCG GCAAAATCGA TGCCGAACTC 300 

GTGCAAGTCA ACGCATC TAT TGCTCACTAC GCGCAsTTaC AGTAGGCAAG GGATTGCTGG 360 

TATCACAGGT CAAGCGGGGA AGCCCCGCTG CACAGGCAGG ACTGCGCGGT GGCACGACGG 420 
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CCGTACGCTA TGGACTGGGA CGGAGAGCAG CGGTTATCTA CTTGGGGGGA GACGTCATTA 
CCGCCATCGA CAACCAGCCT GTAGCGAATC TGAGCGATTA TTACTCGGTG TTGGAGGATA 
AGAAGCCTGA CGACGAATTC GCGTTACAGT ACTCCGCGGC AGACGGCAGC ATGTGGTAGC 
CGTGCGGCTC ACAGAACGCT CAGATGAGTA GCGAGGTCGC GCGCCCCGTG CAGGCTGCCT 
TTTTCACTGG TTACTTCATG ATGCTGCGTG GTCCTCTTTC CTCCTTTTTC CCC TTCTTTT 
CCTTTCCCTT GTAGGCGGCG TTTTTATCTG TCTCTGGTGT CTTCTTTGGA TCCTGTTGCG 
TGTCAAACGT TTCAAACGAT TGCGCCATTT CCTCTAGCAG CGCGTCAAGA TTTCCCTGAT 
CGAGATACTG CAT AAC C TG T GAACATTGAG CAGGAGATAG TGCAATGCGC ACCGCAGGgC 
AGTTGTAACC ATTTGCACCC GGAATCGTGC GGTTTGCAAT GATAAAATAA GGGCGATCAT 
CAGTGATAAA CTGATACTCA AAGCGCATAG TGGGGGTGGC ATTGTGCGCA GAGCCAAGAA 
TACCCCAGGT CATCAGCGGA GTAGTTGTCC CAAAGTACGC CCGCTCACTA CTCTTTTCCC 
GCGTGAGyky TTGTGCTTCA TATTCGCCTA AGTACTTCTC AATAGcTTCG CGCAGTGCTG 
TACGGTCCTT ACGCTCAAGG TAGAGCGTAA TGCCGTCCAG TAGGAATTTG AACTGCATAA 
GCACAGTGTC GATCGGCGGG TCAAACACAA AGGTAAAGTC GCGTGGAGAA ATCGCAGTAC 
GCAAGCGGtC GACCGTGTAG GCATTCAGGA CTCCTAATTC CTTAGGAGGG TAG TC GTTCG 
AAACAGTCAT GTTCGTGCTT GAAGCACAGC AAACCAAGAC CCCCGTACCC AGCAGTGCAC 
CCAGAGCGCG AT An C CTGCG CGAC ATAAC C TGATTCCCCA CTTCCGTAAA GGnAGAGTGG 
AGGGAGAAAG CATACAAAAT CCTnAGCGTT TCCATGGGGA CGTCAGCGTA CACACAAGCG 
nTGTCA 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5380 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
TCAGCCTATG GTCTACGGGA ATTATGGGGG AGGCAGCTAC TCGGGTCGGT TTTTCAGTAG 
GAATATTATA ACTGGAGAGA AGAAACTTCA GGGTCAGTAT TTTGAGGAGC GGTTCGACGA 
ATGCGATGCC GAAGG cAGTG a TGT AAATGC GATAAAGCCG GCTTATCTTA AGCAGTTGCA 
GGATATTGCG TGGAAACTGG AGGATCACAG CCGAGAGATT CGGGAGGTTC GCTTTACTAT 
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CGAGGCGGGC AGTTTATGGC TTATTGAGCA AAAACCTGTC GAAGCGAAGA GCACAATCTC 
TTTGGTACGG TTGCTGCTCG ACCTGTACGA GCGCGAGGTG GTGGATGCTG AATACGTGGT 
CAAGTCGGTA AAACCGGGTC AGCTGAACGA GATTTTGCAC CCGGTCATTG ATATGACGAG 
TGTGACAGGT TTGAAATCCT CGCAGGGGGG GATTATTGGT GTTCCTGGTG CGGCGGTTGG 
GCGAGTGTAC TTTACCGCTG ATTCCCTCAT CGAGGCGTGG CGTGTGGCGA AGATGGGCGG 
ACAAGATACA CGGTGTATCT TGTGTATGCC TGCAACGTAC GCGGGGGACG TTAAGGCAAT 
TGAGGTGGCA ACTGGTGTTC TTTCTAACGA GGGGGGGTAC TCCGCCCACG CTTCGGTTGT 
TGCCCGTCAG TATGGGAAGA TCTCTTTGGT CCGTCCAGAT ATGAAGATTT ATTCGGACAA 
AGCGGTCGTT GACGGTATGA CTATCAACGA GGGCGATTTT GTAACGCTTA ATGTTCCTTA 
CTACGGGGAA TCCACCCTGT ATATGGGAGC TGCGCAgcTC ATTGAGCC TG ATCCAGAGAC 
GTCTGGCCTA GTGAGCTTCA TCGAGCTTGC GAAGGGTTTT GTGCGTTCGT TTCACGTGCG 
GGCGAACGCG GACAGTCCAC ACGATGCAGA gCTCGCGCTC GCCTTTGGTG CGCAGGGTAT 
CGGACTGTGT CGTACAGAGC ATATGTTCTT CAAAGAAGAT CGGATAAATG TGTTCCGCCG 
TATGATCTTC TCGGAGAATG CTGAGGAGCG GACGGGCAGT CTCAAGCAGT TGCAAAAGAT 
GCAGGGAGAG GATTTCTACG GCATCTTCAA GG TAATGC AG GGACATGAAG TGACTATTCG 
CCTTCTGGAT GCTCCTTTGC ACGAGTTTTT GCCGCACGGG GAGAGTGAAG TTAGCAAGTT 
TTTGGAGTAT CTCGAGAAGG TTTGTGGTAA AGGTCTGTCC CGGGAGGAGT TGCAGGAGCG 
GATCTCCATG CTATCTGAGG TGAATCCCAT GCTGGGTCAC CGTGGGTGCC GTATTGCGAT 
TTCATACCCG GAAATCTACG CCATGCAGGT GCGCGCCGTG TTCGAGGCAG TGTACCGGTT 
GCAGAAAGAG AAGATCTCGG TGTACCCAGA GATAATGATC CCCATTGTCA TGAATTGCCG 
TGAGTTAAAG CAGATTGTGT ATGGTAAAAA GATTGAGGGG CACGCATACC AGGGTATCGG 
CTCGATAGAG GAAGAGGTAC GTCTTGCGCT CAAGGCAAAG GAGGTTGACT ATAAGGTGGG 
TGCTATGATT GAGCTGCCTG CAGCTGCGTT GAGTGCAGAC GAGATTGyGC gcTACGngcA 
GTTTTTCTCG TTTGGGACTA ATGACTTGAC GCAGACAACG CTTGGACTCT CCAGAGACGA 
TTTCAATACG TTTATGCCGG ACTACACGAT GTATGATTTG GTTGACGGAA ACCCCTTTGC 
GATACTCGaT GCGCgCGTGC scgAGTTAAT TGAGGTTGCT ATGCaGCGTG GACGGCTGGC 
ACGGCCGGAT ATTCAGCTAG GTTTGTGTGG GGAGCACGGT TCACGGTCAG AAAATATTCG 
TTTTTGTATG GAAGTAGGAC TAGATTACGT TTCGTGTTCG TCTTACTCGG TGCCTATCGC 
TTTACTTGCA ATTGCACAGG CGGAGATTGA AAACGCAGAA AAGGAAGGCA GGAAGCCTGC 
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ATGGCGGGGA AGGTCTTCCG CGAAGTCAGG CGGTAGGCGC GCTAGGTAAG GTGTCGTTCG 
TGCTTGGTGA GCGTCTCTGT GCGTTCCACT AACGGTGCGA GGGATGGCTG CGTCTGCGTG 
GTTAGAGGTG TAGCTGGGTG TTTTTTGGAG G TTTTGTGT A CGCGCTGATT GAATACAAGG 
GCAAGCAGTA TAAGGTGGAA CGGGGTAGTA GTATCGTTGT AGATAATATC TCCGAAGTTG 
CGCCGGGCGG GTGCATCGAT GTGCGTGAGG TGTTGATGAT TGGTGGCGAG GGTTTGACGC 
GGATTGGTTC TCCTTATCTT GAAGGAGTGG GTGTGCGCGC GGTGGTGGGG GAATGTTTTC 
GCAGTCGGAA GATTACCGTG TACAAGTATA AGAGCAAGAA GGATT AC C AC CGAACTATCG 
GTCATCGGCA GTGGTACACT CGCTTGACCG TTAGTGACAT CTTGGGGGTG TAGGCTCTGG 
TCCGAGTGTT GCTTGAAGTC GGTGACCGAG GGCAGTTTTT ATCTGCAGTT GCCTCTGGTC 
ATGCTGCGCG TGGAACACGA GGCGGTGATG TTGTGTGTGC GGCAGTTAGT GTGCTTTTGC 
GCACTGCGGT GCTTGGGCTT GAGCGTTTGG GTCCTCAGAT AGAGGCGGCG GATCGGGGTT 
TTCTCTCCTT TCGCGTGGGG GGGTGTCCGG ATTCCGCGTT GGCTCTCTTG TGTTTCACTG 
CGGAGTTTCT AGAACGTGGT TTACG TACGT TGATGCAGGA GTATCCCAGT TCGGTGCATC 
TTTGCGTGCG GAGGGGAGTG GTGTGTGCGT AgcGTCGCGG TTAAGACAAA ACGGGGGGTA 
GTATGGCTCG AAAGAGAGGT GGCAgTGGAT CTAAGAACGG GCGCGATTCT AATCCGAAGT 
ATTTGGGAGT AAAGTTGTTC GGTGGTCAGC ACGCTCGTGC TGGTTCGATT TTGGTGCGCC 
AgCGGGGTAC CCGAATTCAC CCGGGAGAAA ATGTGGGAAG GGGGAAGGAC GATACGTTGT 
TTGCTCTTGC TCCTGGGGTT GTGACCTATC TTCAAAGGAA GGGGAGGCGC CTCGTTTCTG 
TGTGCGTGGA AAACCGGCCT TCTTGAGCTT TTATAGAGGG AACCAGGTGC CTCCTGTGCT 
TTGTGTCTGT GGTGTAAGAA GGGTCAGGGG GTATTTGCGT GTCTGTGGGA TCGTAGGGGG 
CGTGCACAGG TTTTTGAGAG AGCATGGCCA GTTTTGTTGA TGAGGTGCTG ATTCGTGTTT 
CCTCTGGTCG GGGTGGAAAT GGCTGTGTGG CGTTTCGGCG GGAAAAGTAT GTCCCgCGCG 
GCGGCCCCGC GGGGGGCGAT GGAGGGCGCG GCGGGGACGT TGTGTTCCAG GTACGGCGCA 
ACATGCGCAC GCTTGTGCAC C TGAGGTATG GACGCGTGTT TCGTGCAAAG AATGGGCAGG 
ATGGAGAGGG GGCACGCCGC TTTGGTGCAA AGGGGCACGA TTGTGTTATA CCGCTGCCTC 
CGGGTTGTCT TTTAAGGGAT GCGCAGACTC ATGAGGTTTT GCACGATTTT GGTCATGCCC 
ATGAAGGTTG CGTGACGCTC CTTTCGGGTG GAAGGGGTGG TTGGGGGAAT TATCATTTCC 
GTGGCCCAGT GCAGCAGGCT CCGCAACGCG CGCATTCTGG GCAGCCGGGG CAGGAACGTG 
TGGTGCACGT TGAACTGCGT ATTGTGGCAG ACGTTGGCTT TGTGGGGCTC CCCAACGCGG 
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GCAAATCTTC TTTGCTGAAT TTTTTTACCC ACGcGCGGTC GnTtwGcCCC TTATCCTTTC 3780 

ACTACCCGGA TTCCTTACTT GGGGGTGCTG CGTACGGGGG ATGGGCGCGA CGTGATCCTG 3840 

GCAGATGTCC CTGGGATTCT CGAACGCGCC TCGCAGGGTG TCGGCTTGGG GTTGCGCTTT 3900 

CTCAAGCACT TGaCCcGCTG TGCGGGGCTT GCATTTCTCA TTGATCTTGC AGATGAGCGT 3960 

GCGCTGCATA CATACGATTT GCTTTGCAAG GAATTGTACG CTTTCTCCCC TGTCTTTGAG 4020 

ACAAAGGCGC GCGTGCTCGT AGGTACCAAG CTTGATTTGC CGAATGCGCG TGAGTGTTTG 4080 

CAGCAGCTTc tGCACAGCAC CCATCCACTG AGGTTTGTGG AGTCTCGGTG CACAATCGCT 4140 

GGGGTTTAGA TGAATTGCAG GAGGCTTTTG TGCGTCTCTC TGACGCAGGT GCGGGCgcGT 4200 

TGCGTTCCCC TGTGTGGCGG AACCAAGCTC CCAGTTTTAT GTACGCTCAG CTTGAGGATC 4260 

CGGTGTGTCA GGTGCGTGAT GATTTTGGGG CAACGGTGAG CTTGAGCAGA AAACGAAAAG 4320 

TGCGCGGATG AAGTTAGCCC TGTTTGGCGG TTCGTACGAT CCTGTTCATC TGGGCCACTT 4380 

GCTCTTGGCT GATGCAGTAC ACCGGCACGC CGGGTATGAC CGCGTGCTGT TTGTGCCTAC 4440 

CTTCGTTTCC CCCTTCAAAG AAAAGGAAGG AAGTGCAAGT GCGCACGATC GGGTGCGGAT 4500 

GCTCCATTTA GCAATTGGGA CAACGCCGTA TTTTTCTGTT GAAGAGTGTG AGATTAGGCG 4560 

TGGGGGTATT TCGTATACTG CCGAGACGGT GCAGCATGTG CGGGAAAAGT ATGGCGCACA 4620 

GCTTGAGGGC AAGCTCGCGC TGGTATTGGG GGAAGATGCA GCGCGCAGTG TACCGCACTG 4680 

GCACGCGTTC GATTCGTGGA GTACACACGT TGATTTtGTC GTGGGTGCGC GCCCTGTGAC 4740 

GTCAgGCGAT GGGGGGAACG TAGAACGCGC CACACGCACT CTACAATCGT TTCCCTTCCC 4800 

ATGGGTTTCG GCGGAGAAtG TGGcGCTTCC TATTTCGTCA ACGTACATAC GCACCGCAAt 4860 

TCAACGGGGG CgTAgTTGGG GTTaTCTTgT ACCTtCCCCA GTTCGTGAGT ATATTATCGC 4920 

GCGTGGACTC TACCGkTCGT GAGCCTCTGC CTTCTTTCTC CGGAGTGGAC GATTCTTCCT 4980 

GTTCTTCTCG TTTTCTTTCT TTCCTTGGAC AGGCGCAGAT GCCAGCGTTC TCTTTTGTGT 5040 

CTGCAGATAT GACGGCGCTG ATTGCGCGTG TGGmAcCGTA CGCGCGCGCA GTGCTTTCGC 5100 

CACCTAGGTA TGAGCATTCG CGTCGTGTAG CGGAGTTTGG GTGCATGCTG GTACGGCGGT 5160 

ATGCGTTGGA AGCGCAGCTT GAGCCGCACG TTTATTGCGC GGGTATTGCG CACGATATGT 5220 

GCCGGGAGCA TTCGGAAGTG TTCTTGTTGC GTGCTGCcTG CGTGCGATGG TTTTCCCATT 5280 

GATGTAACTG AGCGTGGTAC GCCATTGQTC TTGCACGGGC GCGCAGCCGC GTGTGTGTTA 5340 

GCACAGGAAT TTGGCGTGCA GGATGAGGTG TTGTTGTCTG 5380 
(2) INFORMATION FOR SEQ ID NO: 142: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

CACGCTCAAC CGGGAACGAT TCGTGGTGAT TTTGCnTACG CACCACTACT AATATTGTCC 60 

ATGCTTCTGA TTCTCCCGAg AGCsCTGCAC GAGAACTAGC GCTCTACTTT TCTGCGCAAG 120 

ATTTTGTTGA ATGGCGTGAC GGGAATTACG ATTTTTTCTA AAAAGTTTAG GGTCTGCGGT 180 

GCGTGTTCTT TTTCGATACG GTGATCATGT TTCCTCTTGC TGAAGAAAGG TGAGGATCAG 240 

GGGAGGGGTT AGGAATCATG GCTTCGATTG CAaTACTCGG TGGAGGGGCA TGGGGCACGG 300 

CGCTTGCTGC GTCTCTCACC GTAAACGGTC ACACCGTAAT GCTGTGGGCC CGTCGTAGGC 360 

AGACGTGCGA TGCTATCAAT GCACGAAACG AAAATGTTCA GTATCTGCCG GGCATTACGT 420 

TGCCCGCAGC CTTGTGTGCC TCTCCCGATA TGGCATATGT CTGTGCCGGC GCGGATCTTA 480 

TTGTATTAGC GGTTCCTTCG TGCTATTTGG CTGAAGTAGC TGCGCTTATG AATACCACTC 540 

CTCGTTTTCA GAGGTTGCGT ACTGCTGCCG TAGgACAGGA ATATCCCCTT ATTGGTATTT 600 

TGACAAAAGG ATTTATTCCG GATCAGGAAG GGATGCCTCA TCTAATTACC GATGCGCTGG 660 

GTGCGTTGTT GCCGTCTGGG GCGCACGGGC AGCTCGTGTA TATTTCGGGT CCAAGCCATG 720 

CACAGGAGGT AGCGCAGGGA AAGGTGACCG GACTTATTGC AGCGAGCCAA AATCCTATGG 780 

CGGCCATTCG GGTGCGGGAA TTGCTGCGCT CGAAGAGGgT GCAGGTGTAT TCCAGTCTTG 840 

ATGTTGTTGG GGTGCAAGTG TGTGCAGCGG TAAAAAACGT GATTGCCATT GCATTTGGTC 900 

TTTTGGATGC GATGGCTGAG CATTCTGAAG CTTTTGGGGA CAATACAGAG TCGATGCTGC 960 

TCGCAGCGGG CTTGAATGAA ATTCAAACCA TTGGAAAACA GTTGGGTTCT ACACATCCTG 1020 

AAACATTCAC ATCGCTTGCA GGAATAGGAG ATTTGGATGT GACGTGTCGC AGCGCnTATG 1080 

GACGCAACCG ACGTTTCGGA CGCGACATAG TGCATAAGGG GATCCTTGAT TCCTTTTCTG 1140 

GAATACAGGA TCTCGTGAGT CGTTTGCCCG AAGTAGGGTA TCTGGCGGAA GGGGTAGTTG 1200 

CCTGTATGCA TGTGCAGCGG CTGGCTGAGC GGGATCGGTT GAAGGTTCCA ATTTGCGCGG 1260 

GACTGTACGC TATTTTAAAT CGGGAAAAGG GTGCTGACAC CTTTATGCAA GAGATTCTTG 1320 

GTTGGTAGCA CGGGGGTGTT TTCTTCCGCG CGTCTCTGTG GGGGAAGCGT AAAGAACGAG 1380 

TCTAGGAGTG GAAGTGAGAA CTCTTACCCA AATGTTATTT TTGGCCCATA ATCCGCGTTC 1440 
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GGACGGAGAG ATATATGCAT ATCATCAAGC GAAATGGCGA ACCGCAACCT TACATGCGCG 
AGAAAATAAT TGTTGCTATC AGTGCTGCTT TTAGAAGTGT CCAGAATCCT CTTGCTCCTG 
AAGTTCCTGC TATCATCACA GATCTTGCCG CGGAGGTTGA GCGACAGCTT TTTGAGATGA 
ACCGTGCGGG CGTTCCTGTT CACGTGGAAA AGATTCAGGA CTTTGTCGAA AAGACTCTTA 
CCAAATACAA TCACAGCGAT GAAGTGAAGA GTTTTATCCT GTACCGTGAC GATCGCACAA 
AAAAGCGTAT TGCAAGAGAA CAGATTGCGT GCTGTTTTAC TGACTCTTCA GTGCTCGGTG 
TACTGAAAGA AATCCAACAA GACTTTCCGT TTCCTGAGTA CAGTCTCGAT GCACTCGCCA 
GTAAGTTCCT GCTCTTTAAA AAAGAAGTTA CGGACGAGCG TCGGAGTATG CAACTGCTTA 
TTAAGGCAGC GGTGGAACTG ACTGCCCAAG AGGCTCCCCA GTGGGAGCTT AtTGcTGCGC 
GCTTGCTTAT GCTCGACTTT TCACTCGCGC TAGGAACATC TTTGGAAAAG TTAAATATTC 
ACTCCTTCTA CGAGAAAATA ACTTATCTTG AAGAGGCCGG TCTATATGGG GTGTACATCC 
GCACGCACTA TAGTCGGGCA GAAATTGAGG AAGCTGCCAC GTATCTTGAG TGTAGTCGCG 
ATAAATTGTT TACGTACAGC AGTCTGGATA TGATTCTGCG TCGCTATGTG ATCAGAACGC 
GTGCGCATGT ACCTCTTGAA ACTCCTCAGG AGATGTTTCT CGGTATTGCA CTGCATCTAG 
CGATGAATGA AACCCAAGAT CGTATGCAAT GGGTAAAACG CTTTTATACA GTCCTCAGCA 
AGTTGCAGGT TACGGTCGCA ACACCTACGC TTTCAAACGC GCGCAAACCT TTTCATCAAC 
TTTCCTCGTG TTTCGTTGAT ACGGTGCCAG ATTCGCTCGA CGGTATCTAC CGCAGCATCG 
ACAATTTTTC CCAGGTATCT AAGTTTGGGG GAGGGATGGG GCTGTACTTT GGAAAAGTGC 
GTGCGGTAGG CGCTCCCATT CGGGGGTTCC AGGGTGCTGC AGGTGGTATT CTCCGTTGGA 
TTAAGCTCGC CAATGATACT GCAGTTGCAG TAGATCAACT AGGAGTACGC CAAGGCTCGG 
TGGCAGTGTA TTTGGATGTA TGGCACAAGG ATATTCCGGA ATTTTTGCAA TTACGGACTA 
ATAATGGGGA TGACCGCATG AAAGCACATG ACGTATTTCC TGCGGTCTGT TATCCAGATT 
TGTTCTGGAA GACAGTACGC GATAATTTGG GGGCGTCGTG GTATTTAATG TGTCCGCATG 
AGATTCTTAC GGTGAAAGGC TATGCTTTGG AGGATTTTTA TGCGGAGGAA TGGGAGAAGC 
GCTACTGGGA TTGTGTAAAG GATGCGCGTA TCTCTAAGAG GACCATTCCG ATTAAGGAGT 
TGGTGCGCTT GGTGCTAAAA TCTGTGGTGG AAACCGGTAC TCCCTTTGCG TTTTACCGAG 
ATCATGCAAA CCGTGCAAAT CCCAATGGGC ATCGGGGAAT TATTTACTGT TCTAATTTGT 
GTACTGAAAT TGCGCAGAAC ATGAGCGCTA TTAATTTAGT AAGCGTAAAA ATCACCGAGG 
TTGATGGACA AAAGGTAGTG GTGCAGACAA CGCGGCCGGG GGATTTTGTT GTATGTAACC 
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TCGCGTCGTT GGTGCTGAGC AATATTGACC TTTCAGATGA TAAGGAGTTG C GCGAGG TAG 
TGCGTGTGGC GGTACGTGCA TTAGACAACG TGATCGATTT GACATATTAT CCGGTTCCCT 
ATGCACAGGT AACCAATGCG TATTATCGTG CTATTGGTTT AGGTGTTTCA GGCTACCATC 
ACGTGCTTGC CCAGCAAGGA ATCGATTGGG AAAGTGATGA ACATCTTGCA TTTGCGGACA 
GAATATTTGA GCGCATTAAC CGTGCCGCAA TTGAAGCGAG TATGACAATC GCGCGCGAGA 
AGGGTGCGTA TGGGTGTTTC ACTGGGAGCG ATTGGTGTAC CGGTGCGTAT TTTCGCAAAC 
GCGGCTATGT CTCTGAAGAC TGGCAACGTT TGCAGCGTGA GGTAGCAACA CATGGGATGC 
GCAACGGTTA CTTACTTGCA GTCGCGCCAA CTAGTTCTAC GTCTATCATT GCAGGGACCA 
CTGCGGGTGT AGATCCTATT ATGAAGCAGT ATTTCCTCGA GGAAAAGAAA GGCATGCTAA 
TGCCACGCGT AgCTCCTTCT CTTTCGCAGA AGACCTGTCC ACTGTATAAA AGTGCACACG 
CAGTGGAGCA GCGTTGGAGT ATCCGTGCTG CGGGTCTGCG GCAACGACAT ATTGACCAGG 
CACAGTCAGT GAATCTGTAC ATTACAACGG ACTTTACACT GAAGCAGGTT CTAGATTTGT 
ACGTGTATGC GTGGGAAGTA GGAATGAAGT CACTATAtAC GTACGAAGCC AGTCGCTCGA 
AATAGATTTG TGTGGGTATT GTGCCTCGTA GGAGCGTGCT TGCATAACAC TGTCCACTGT 
GTAGGCTTTC TTTGTGACGT TGCGTACGCT TCAAGCCGGT GTGGCGGTCA GTATCGCTCT 
GGATCGTGTG TGCTTTTTCT GTTATAACGG GGCGGTGGCA CACTGTgTAG TAGAAGCTGC 
CGAAGATATT TTGGACCGGC GTTTTTCTGT ATTGGATAAG GGTTTCGTGC GTTTGATAGA 
TTACCTGGGA GGGGATGCAC GCATTGTGCA GGCAGCGCGT GTTTCTTACG GTGCGGGGAC 
TAGGACTGCG CGTGACGATG CGGCGCTTAT CGATTTTCTT TTACGCAATA AGCATACGTC 
TCCTTTTGAG CAGGTGGTCC TTACCTTCCA TGTACGTGCA CCGATTTTTG TCGCGCGTCA 
GTGGATGCGG CATCGCACTG CTCGCATCAG TGAGGTGTCT AGTCGTTATT CGCTT CTTAG 
TCATGACTGT TATGTTCCGC .AGGAAACTTC AGTTGCAGTT CAGTCCACGC GTAACAAGCA 
GGGCCGCGCG TCCGAAGGTA TCTCTCCTGA ACAGCAGCAG GAAGTGCGGG CAGCGTTTGA 
AGCTCAGCAG AAAGCGGCGT GTGCCGCTTA CGACGCATTG ATTCAAAAGA ACATCGCGCG 
GGAnCTAGCG CGTATTAACG TGCCgCTTTC GCTTTACACC GAGTGGTATT GGCAGATTGA 
TTTACACAAT CTTTTTCATT TTTTGCGTTT ACGTGCGAGC GCTCATGCGC AAGCAGAGAT 
TCGTGCGTAT GCAGAGGTAA TCATTGAAAT TACCCGTGCA GTTGCGCCGT GCGCTACCGC 
CTCTTTTGAA AATCATGAAA AAGATGGGGT GCAGTTTTCA GGGCGGGAGT TTGctGCGCT 
TAAGGCCTTA CTGGCTGGAG AGGGTCTCTC CCTTGAGGGG AAGGAACGTG CGCGCTTTGA 
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AGAAAAATTA CGCTCTGGCC TGCAGCAGTA GAAGTCTATA GTGCGCTCGT CTGTGTGAGC 4980 

AGCAAGAGTA TTGCCTTTCT GTGTCTTAAA AAGGTGAATG TGGTCATAGG TATGCTGAkG 5040 

AAAAGGAGAG CGGTCAGTTA TGGGGATTGA GTACTCAGCG AGTAGCATTA CTGTATTGGA 5100 

AGGTCTTGAA GCGGTACCAA GCGTCCGGGG ATGTATATCG GCTCTACCGG TCCTAATGGA 5160 

TTGC AC CATC TGGTGTACGA GGTGGTGGAT AACTGTATCG ATGAAGCCAT GGCTGGGTAC 5220 

TGTGATCGTA TCACCGTGGT GCTCGAACAA GGAAACGTCG TGCGTGTTGA AGACAACGGG 5280 

CGAGGTATtC CTGTTGACGT GCACCCTCAT GAGGGGGTTA GTGCGCTTGA GGTTGTACTT 5340 

ACTAAGTTAC ATGCGGGGGG GAAGTTTGAC AAGAAATCGT ATAAGGTGTC GGGTGGACTC 5400 

CACGGAGTTG GAGTTTCtGT GGTCAACGCG CTGTCGTTGT GGGTAGAAGT GACAGTGTAT 5460 

CGTGATGGTG CTGAGTATTA TCAGAAGTTT AATGTGGGGA TGCCGCTTGC TCCAGTAGAG 5520 

AAGCGGGGAG TGTCGGAAAA ACGTGGcACT ATTATCCGCT GGCAGGCGGA CCCATCCATT 5580 

TTCAAAGAAA CGGTGGCCTA TG ATT TTG AC GTACTCCTGA CGCGTTTGCG TGAACTTGCT 5640 

TTTTTGAATA GCACGGTAGT TATTCAGTTG CGTGATGAGC GGTTGGCGAC CGCTAAACAG 5700 

GTTGAATTTG CGTTCGAAGG AGGTATTCGT CATTTTGTCA GTTATTTAAA CCGCGGTAAA 5760 

TCAGTTGTGC CCgAACGTCC TCTGTACATT GAGGGATCGA AGTCGGATGT TTTAGTGGAA 5820 

GTTGCGTTGC AATATCACGA TGGTTATACG GAAAACGTGC AGTCATTTGT CAATGATATT 5880 

AATACCCGTG AGGGGGGCAC GCATCTTGAA GGATTTAAGT CGGCACTTAC GCGTGTGGCG 5940 

AACGATTTTT TGAAAAAAAG TCCAAAGCTT GCAAAGAAGA TAGAAAGGGA AGAAAAGCTC 6000 

GTTGGGGAAG ATGTGCGTGC TGGATTGACA GTGGTGCTTT CTGTGAAAAT TCCTGAACCC 6060 

CAGTTTGAAG GGCAGACAAA G AC G AAgTTG GGAAACAGTG AGGTGCGGGG TATTGTTGAT 6120 

TCTTTGGTGG GGGAGCGTCT GACGCTCTAT TTTGAGCAAA ATCCAGGTGT GCTTACAAAG 6180 

ATTCTTGAAA AGAGCATTGC AGAGGCGCAG GCGCGTCTTG CAGCACGTCG TgCAAAGGAA 6240 

rcTGCGCGCA GAAAAAGTGG AATGGATAGT TTTGGGTTGC CGGGAAAGTT GGCCGACTGT 6300 
TCGCTCAAGG ATCCGGCGAA GTGCGAAgTA TATATTGTGG AAGGGGATTC TGCAGGAGGT 6360 
TCGGCGAAAA AAGGACGGGA CAGCAAGACA CAGGCCATTT TGCCTTTGTG GGGGAAGATG 6420 
CTGAACGTGG AAAAGACACG TTTGGATAAG GTC TTGCAT A ACGAAAAATT ACAGCCAATT 6480 
ATCGCAACGC TCGGTACAGG TGTTGGCAAG GATTTTGATT TAACAAGGAT TCGCTATCAT 6540 
AAAGTGATCA TCATGGCGGA TGCGGACGTG GATGGCTCTC ACATC CGT AC GCTTCTTTTA 6600 
ACGTTCTTCT TTCGATACCT GCCGCAAATA ATTGAAGCTG GTTACGTATA TCTTGCGATG 6660 
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CCGCCTTTGT ATCGCATTGC GTGGAGTAAA AAGGAACTGT 
CGTGACGAAG CGCTAGAAAG TATCGGTAAA AAAAGTGGTG 
GGTCTGGGGG AAATGGATGG CACTCAGCTT TGGGAGACAA 
AAGATGATGC aGGTGGTGCT CTCAGATGCG GTGGAAGCAG 
ATGGGTGAAG ATGTCGAACC GCGCCGTAAG TTTATTGAAG 
TTGGACGTAT GAATGTTTTG TGCATTTGTA TTTCCATGTG 
GGAAGGATCA GTGGCGTATC AAGTGACGGC AACACGGTAT 
CGTGTTGGGT CAGAAGTTTG TAGTGGCAAC ACTGCAAAAA 
TTCTCCTGCG TATTTGTTTT CCGGCCCACA TGGGTGTGGT 
CCTTGCAAAG GCATTGAATT GTGTGCAAAG AGAAGCGTCT 
GTCTTGTAGA GAGATTGCCA CCGGTACTAA TTTAAATGTT 
ACACACAGGG GTGGGCGACG TACGTCAGAT TAAGGAAGAG 
GACGCGTTAC AAGGTTTTTA TTATTGATGA GGTGCATATG 
TGCACTGTTG AAGACAATCG AAGAGCCTCC GCCGTATGTG 
GGAGGTGCAC AGGATTCCTG CAACGGTAAA AAGTCGCTGT 
GGTAGATACT CAGACGCTTG TTTGTGCGTT GGCGCAAgCT 
AGTTGAAGAC GGAGTACTGT CTTGGATTGC GCGTGAGTCA 
ATATACTTTG TTCGATCAAA CCGTGGTGTC TTGCGCAGGG 
TCAAAAAAAA CTCGGGCTAA TGACTGACGA CTCACTTGCA 
CCGCAAAGAT GCTCGCGCCG CCTTGGAATT GGTAGATGCT 
CGTTGAACAG TGCGTAATCG ACTGTGTCCG CTATGCGCGT 
GGgAATTACA AATGAGTCAC TGGTAGGAAT AGCGGCAAAC 
TACCACATGG AATGCGTCGC AGATTGAGCG GGCGCTTGGA 
CGACATTCGT TTTTCAGTAG ATCCGCGGTA TGAATTGGAG 
tGGTTGAGTG AGTATGTCTC AATTCAAGAA GTACGCGTTG 
ATACTGGACA CGCATGCAGT TCCCGGGGTG TGTTCTGCGT 
GAAACAGGTG TCGTCTCCCC ACAGGGTATA CGTCCCCCTA 
GTGCGTGCGT TACAAGATGC CTTGGTAGAA AAGTTGCGCG 
ACAGCGCTTG GTTCTTCATA TTCTTGGCGC GAGGAAGATA 



ATGTGTATAG CGACACAGAG 6720 

TCGCTGTGCA GCGTTATAAA 6780 

CTATGAATCC AGTGCGTCGC 6840 

ACCGGGTGTT TAGTACTCTC 6900 

AGAATGCAAT ATATGCGCGT 6960 

AGTGGTGCCG TCTGACAGGG 7020 

CGGCCGCAAC GTTTTCAGCA 7080 

TCTCTTGAGG AGAACAAAGT 7140 

AAGACCAGCT GTGCGCGTAT 7200 

GAACCGTGTG GAGAGTGTCC 7260 

ATCGAAATTG ACGGTGCGTC 7320 

ATTCTCTTTC CACCTCATGG 7380 

CTTTCAAACA GTGCCTTTAA 7440 

GTATTTATCT TTGCAACAAC 7500 

CAACAATTTC ATTTTCGTTT 7560 

GCCCAGCAGA TGCAGATTGC 7 620 

GCCGGTAGCA TGCGAGACGC 7 680 

CCGGTAACAC TTGAGAACAT 7740 

GCACTGTTTT CACATTGCTG 7800 

TTGGTAAGTT CTGGTGTCTC 7860 

GCACTGTTGC TTTTCACGCA 7920 

CAAGTGCCTG AGTACGTGCG 7980 

CTGTTACTGC AACTGTTTCG 8040 

CTCGCAATTT CGCGTTTAGs 8100 

CATTGGATAG TGTGCAGCAG 8160 

CTGTAGGTTC GGACGATGGG 8220 

TGTCAACATC AGTATGTACC 8280 

CGTCACACCA GATGTTGGcA 8340 

CTTCTGTGTG CATGTGCGTA 8400 
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CGCAgCATTA TGAGCGCAGg TTATCTCTCA GCACGCGTCG CTGCTCAAGG AGTATGCGTC 84 60 

AGAATTACTG GGACGGGAGG TATGCGTTCG CG T AC TTCTG GATTCGGTGC CTTCGTCAAA 8520 

AGTCGCGCCC TCCCATCTTC CTCAGAGTCC TGcCCCATCT GCTCTCTTTA CAACTTCTTg 8580 

CTTAcTCTGG GgCAGGAGTG TGATAgGGTG AtGGAGATCT GCCTGCACAG tGaAGCTTCT 8640 

CTGTGATTGT GTGCAAGGGC ACGTGGTGCG TGTGTATGAA GGTACTGCAC GGTGTGTACC 8700 

TGGGGAGGGG AAAATAGCAG GGGCTGTGCG TACTCCGTAT TGAgTAGCGA CAGACGGGTA 8760 

GGAAGAGTGC GAAAGCTGTC AGTTCAATAT GGGGATTTTG TACCAGGGGA TGTGTGATGA 8820 

TACCGGCGAT TGAAGAAGTA GTGGAGCATT TATCTCGTCT CCCGGGTATT GGAGTAAAGC .8880 

TGGCGACGCG CCTTGCCTAT C AC CTTTTGA AACGTGACCC CGCTGAAGCG CAGGTTCTTG 8940 

CGCGCGGGAT TGCGTGTTTG CATGAGCGCG TATATCGGTG TGTGTGCTGC GGTGCTTTCT 9000 

GTGAGGGGAG GACCTGCGCG TTGTGCACGG ATGCGTCTCG GGACCGAGGC ATCATTTGTG 9060 

TAGTGGAGCG TGCGCAGGAC GTGGAAATGA TGGCGGGTGT GGGGGAGTAT CGGGGTTTAT 9120 

TCCACGTGTT AGGAGGAGTC ATTGCACCGC TTGAGGGGGT CGGTCCTGAC CAGCTCCGTA 9180 

TTGCGGCGTT GCTGAAACGG TTGCAAGAGA GTTCAGTACG GGAAGTTATT CTGGCGTTGA 9240 

ATCCCACCGT GGAAGGGGAT ACCACCGCCT TGTATGTGCA AAAAATCCTT GCAAACTTTC 9300 

CGGTAATAGT AACGCGTCTA GcgTCTGGTA TCCCCGTAGG nGGGGACTTA GAATATATCG 9360 

ACCGAACGAC ATTGGCGCAC AGCCTGCGTG GGCGCCGGCC ACTTGATTGC TCGGAGGCTT 9420 

AGAGCATGAT CTCTACCGCA GTGACTACTG CCATATGTGT ACCCGGTGGA CGCTGCGGAT 9480 

AAAGAACACG CTTGCGGTGT TCAGCAAGAA GAGTCCAACG ATACAGGCGC TCCACGTGAG 9540 

TCCGTGTGTG AGCAATCGGT AACCTGCAAT CAC AAAAGGT AATGCTGCCG CCGCGATGTA 9600 

GTAATGGGAG ATTTGCAACG TGTGGCCGAG CACCCCGTAA GAAAGTGCGC TGAGGACTGC 9660 

CGTTGTCCAG AAAAACCACC GGTGCCACGC GTGACTCCCG TAGTAGAACG TGAAGTTCCT 9720 

CTGTGGGGTC ACCAGGATCC GTGTTTTTCT GTGGGCGGAs TGCGCACTTT GAGCGTTTTT 9780 

TCTTCACCAG TACTATCTAT GATAGATCCG GATTCTGAGA TCGGcTGTGC CAGCAGTATA 9840 

GCTCCCTGTG TGCTCTGTGA GGGACGATAT GCGTG T AT AG TGTGTATCGT AGTGTATCGG 9900 

CTAATAAGAA ACGCCACCGT GCACAGCACA AAGACAACGG CCCCTTCCTG TCGTGTGAGT 9960 

GTCTTTTTCG CAAAAATGCC ACTTGCAAAG AGAGCAAGCA GTGCCAGACT CCGGAAGAAC 10020 

ATGACGATTT GCATGGGAAC ATTCCCTGCG TATGTGGTAA GCGTGCCTGT GTAGAGTATG 10080 

GGGAGGAAAA GACGGACTCC TTCGAAGTTG AGTGCGCACA GAAAAAGAGA AAAATAGGAT . 10140 
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ATCTCGACCG CATGTGTTTT TTCGAAGCGA CGAAGGAGGT AAGTAAGCAA GAGTGGTGCA 
GAAAGACCCn T AC CGAGG AT TGCCACCAAG GTACTGCTAT TTGAGGGTGC GGTGGACACC 
GCAGCAAGCG AGCGAGCAGC TGCTTCCGGT AGCAAGAAAG TGAGCACATG CATGAACGCA 
CGGCCAGTGG TGTGTAACAG GGGAGGAACG CGCGCAACGG CAAGTTGTGG GAGTGAACCA 
CCAGATAGCA CGAACGCGGC CTGTAAAAGG GAAATGCTGC ACAGCAGGGC GGAAAGTACG 
ATGGAGAACG CGACGACTTT GTTTCGTCCA GCGATGGTCA C CGGGCC ATT GTAGGCGGTG 
CATGCGTGTT ATGTAAAGTT GTGCTGCGAT CAGGAGTCTT CTTAAGGAGA TTGGCGAAAG 
GTGCCGAAGC TGCGGGCGCG GAGGACGTAT GGGTAGAACA GGATGTGTGG GTAATCGATG 
CAAACGGATA GGTACTGTGC GTGCGCGTGG CGAGAAGCAA AGGAGAATAG CATGGGTGTG 
CTATCTGGCA GCGGTGTTTT TGACGTTATG TACGAAGTCT GCCATGGCGG GAGATATCGC 
GACCTTTGTG AATTTGGGAT TTTCTGCGCA CGGAAAGACA TTTGCCTTTG GACAGCACGG 
GGTAACTGAT GGGTTGTATC a GGCGTATGC GGATATCTAT GTGGTGGACG TAGAATCAAA 
CCGTTTTGTG CAGGGAGGGG TAGTGCGCAC AACGCCGACG CGAGAAACAA AAGGCAAACG 
AAGCATGGAT GTCTTTCTTG CGTTGCAGAA CCGCGCGCAA TCTCTCTTGC AACGTGCAGA 
TATTTCTGCG CTGCGTTTGG GGCGTACTCT GTACGTGCAG GCTGAGGATC GGATGGGGGA 
AGAGACGCTA CTGTTTCGAG ATTTTAAAAC GAATGTAGAA TATGTGGTGG TTATGCATGT 
AGAACGGACA ACAGAGCTGG GTGTGTCGTT CTATTTGACG GTTGAAATGA CAAGACCGAA 
TGGAAGCAAA GTTTCGCGTG TGGTGGGGCA GCGCGGCTAT GTGCGACCGG GGGTGAAAAA 
TTACGCCTTA AAAAAAGTAC TTATTAATGA GCAGCAGGAC GCTTTGATTT TTGTTGTTGA 
AAAGCACGGA TATGCCCCtG ATGGAGCATC AGTTCGGTAC ATGGTAGAGG CGTGTCGTCT 
GTAACGGTTA TGGGTTCTTG CAATCGTACG AGGGTGGGGT ATGGCTTACG TG TG AT AT AA 
TGGGCGTTAG CCTATTTTTG GTGAGGGAGC ATGTCGTCAT GTGGGATCTG CATCGTGCTT 
TATGCGCAGG GTTGTTGAGT GTGCTGCTCT ACGGCGGCTG TTCTTCTGCG CGGGACTTTG 
TACACGTTAT GAAAACTGCG GGCAATGAAG TGGCGCTTGT GCATTACGCG TTGCAAGGTG 
ATTGCATTGT GTTTGGTTTT CGCGGTGAGG TATCTGATGT AGTTGCGCGC GTATATCGCG 
CAGAGGGTGT TGTTTCAGAG GAAATGAGCG ATGAGCTGTT GCAGTCCTTT ACAGACGAAA 
TCCCCTCTGC ACTTTCACGC ATTGCGGTGC CTGCAGCGTA TCAGGAGTCT GTGGATGCAG 
CCCGTGAGCG CCTTTCGTTT TTCGCAGTGC GCTTTTTACA AAAGCCGCAC GTGGGGCAGC 
AGGTGGTATT GCGTGGTAGT GCGCAGGACG TGGCGCACCA GAAGCATGAT TTTGTGCTTC 



10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
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11520 
11580 
11640 
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11760 
11820 
11880 
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CTTTTGAGGG GATAAATACA CAGCCTGCGC GTTTAGAAAT CAGCGAAGTC CGTCCCCTTT 
ATGGGAATAA ACGCTCGGAA TTTGTGGAAT TGCTTGTCGT GGAATCCGGT AATTTGCTGG 
GAATTACTAT CACAAATGTA GGTGGCAAGG GGAATCGCTG CGACTATCAT TTTCCTGCAG 
CCCAGGTGCG GGTAGGAGAG CGTGTGGTAT TACATTGGCG TAACAGGATC CTGCTTCGTG 
TGATGAACTG TCTGCTGAAA CGGTATCGGC AGGTTCGCAG GCCTGTGCGC GTGCACGCGA 
TTTTTGGGGG CAGGAACGCT CAATCCCAGG GCGTAACCCA AACGCAATCG TAGTGAAAGA 
ATCTGCCAAC GGAAAGATAC AGGATGCGCT GCTATTCTTT AATACCCACG TAAAAAAGGG 
AAAGGCGCCG ACGTTTCGCT GGGCTTTTCC AGAAATTGAA GCTGCCTCGC GGCTTGCATT 
GGAGCAAGGG GCGTGGCTGA GCACGCACGA GCACTTCCCT CTAAGAGAGA GTCACTTTTT 
TCAGkGCGAT TTGACGCCAG CGAAGAGTAT CGCGTTAAAG CGCAAACGGG GCGCAGGCCG 
ATCTGCGGCA GATTGTTTTG TGCTAAAAAA AGCAACGATG GGGTTGCCAA ATCAATAGCT 
CTTCAgGGTG CTGCTAACTT CAGCAAATTG ATCGTGTCGG cACCCTGcTT AGGTCTTGGC 
GCTTGCTCTG AAACCAAAAG CTTGTCGGGT TC T AC AGCGG CCGCTTTTGC GTGTC CAC AT 
CCCTTCTCCT ACGCTGa TC A TTCGAGTTCC TCAAGAATTC CTGCCATTTC TTGCCCACAG 
GATACTCCCC ATCTGTCCAA CACCTCGTTC ATCCGTCGcA GATCCTCACT TGaAAGCACG 
aAGCACCCTG cACTCCACGC ATAATTCGTG TCGCGCCCTA ACTTTTCTGA CCACCGGTCG 
TGTATTAACC ACCGACCGGT CTGATATCCT TCTGTAGATA TTTGCATCGC TTCCCGATCA 
ATCCACTCTC CATCCATATC CCATGTGCGT GTGATTGCGT GTATCCGTCC ATGAAACGCG 
CGGGCAGGCA CAAAGCAACG CACTACAAAT GCCCCCGGCG CAACCGATTC CCCATGTGCC 
ACCCCAGACC CCGGTGCCAT CCGCCCGAAA CAATAATTCG CCACGGTTTG TACCATCGCG 
C C AT ACAGCA CCCGTCCCGC TTTATCCTGT AGTTGCATTG TATCACAACT GTTGTTTGCC 
CAGTTGTTCT CGAAACTATC GCCTTTGAGT GGATC TACGC GGTAATGGAA AC TTATC CTT 
CTTCTCAATA CGTGCATCCG TACGTCCTCT CTCATCATTG TCTCTCCCCG CTGTACATCC 
GCGCGGTGCG CCGCACTGCA GACACCCTTC CTCCTGTGAG GCTGGTAATC GGCGTTGGAA 
TAAAAGGGCA GGGTATCGCC TGCTTGCATT CTGCCGCCCT TCCTGAGAAG AAGGCgCGCA 
TCGCGCAGTC ACTGACTACC CTTCCGGCAG CCTCCGGTGC ATCGTGCCTC ACCTTTTTTA 
CCCGTGGACA CATACCCCAA TTGCGCAGTT TCAAAAAGTC CGTTGAACAA TCGTTCGTCG 
TTTTCTTACA CGCAGATGTG CAACAACTAC GAACGCAAAA CATCACGTGG CTTGGATCCA 
TTCGGCGGAC CGACCACCCC CCTTGCTTCC ATTTCTTCGA TTAGGcGCGC GGCGCGATTG 



11940 
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12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 



Printed from Mimosa 02/03/22 07:30:20 Page: 790 



WO 98/59034 



789 



PCT^^13041 



TAGCCTATCT TCAATTTACG TTGCACATAC GATGTGGACG CTTTACCCGC 
ACCTGCACTG CCTGCTCGTA TAAAGGATCG CTTTCATCCA CAAAATTTCC 
GCGTCGTCAT CGTCAAAGAA AATTTCTTCA TCAAGATACT CAGGCGTTCC 
ACATGGGCGA TCACGCGCGC TAATTCTCGC TCGGAAACAT ACGCACCTTG 
GGAAAAGACT GACTCGGGTT CATGTACAGC ATATCCCCTC GTCCCAGCAA 
CCCATCTCAT CCAAAATAAT ACGGCTATCC ATTT 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7247 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



GTATTGCACT 
AGATATACTC 
CCACGCGCGT 
AATCCGCGTA 
TTTTTCTGCG 



13680 
13740 
13800 
13860 
13920 
13954 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

AGGGTGCCGT ACACCGGCAA CCACTCTAAC AACTCCACGC CAGAAAACCC ATACGACTTC 60 

ACCCGTTCAA TGAGACTGAA AATAAGATCA GACTCAAGTA AATTGAGCTC GAGCTTTGAA 120 

GCACCCTTGA ACAAGGCCTC GCGAAAATAC AACTTCGACC AGCGCTCATC TCCCGCAAGA 180 

GCGTATGCAT CAGCAAGTTC AGCAAGAATT GCAGAAGAGT CTGGCTCCAA AGAGACTGCA 240 

AACTCAAGGC ATTCACGCGC Ag c ATCGTAA TTGCCCAATG CTTTATAGCA CAGCCCAGTC 300 

TTACGATAGA CTTCCGCGGG ATGCTCGCGC GCATCACCGG TGATAAGACT CCCATAAAAC 3 60 

CGCAATGCGT AAGTGAACAC ACAACAGCGC AATGCATACA TAACAGGTTC GACAACGCCA 42 0 

CCACGTTTTT GCATATATGC AAC GAACGCA CCCCATTGTC CAACAAGATA CTCACCCTGa 480 

CTAAGACCGC CAGAGATAGT ATGTACACGC TGCATCCGAT CCTCCCAAAA GCGAATACCG 540 

GTAAGCGCGT ACAAAACATC CCGATTGTCT AGCTCTTTTT GAAGCAAATC CTGCAAATCC 600 

TGGTGTGCCT GGCGCAGATC CCCCTGCTTC AGTAAATCCA GTGCTGACTC AAGCTCTACA 660 

CGAATCGATC GTGCAACCAT AGCGGGATAT TGTATCGGTA TCAAAGCGCC ACGTCAACGA 720 

AAGTAC TTTT TCAAAAACAA AACCAACGCT ATGTAAATAA ACACGAGCTA CTCGCAGTGT 780 

CCTTTCACTG TGACATAAAA ATCCCGCAAA CGCGCACGCA CACACTGGTA GAGCTGAGCA 840 

CTTCCCTGCT CTTTCAAATC GTCCGGCACC ATCAATTCCA GCGCATCATC AATTGTGCAT 900 

ACCGCCCACA CGTGAAAgTA CCAGCACGCA CCGCATCCTG CACACGTTCA GGCAAAAACA 960 

ATTGTGCACA ACTACTCTTT GGGATCAAAA CTCCCTGCAT TCCCGTCAGT CCGTTCAGAG 1020 
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CACAGACGTC ATAAAAGCCA CTGATCTTCT CGCTCACTCC TCCAATAGCT TGCACTTGTC 1080 

CCAACTGGTT CACACTTCCT GTTATTGCGC GGTCCTGCCG AAGAGGAAAC CGGCCAATCG 1140 

CAGAAAGCAA TACTAAAAAT TCTGCAGCAG ACGCCGAATC CCCATCAATC CCGTGATACG 1200 

ACTGTTCAAA ACAAAGCGCT GCAGAAAGAC ACAGTGGAAA ACTATCCTGC CCCTTTCCAA 1260 

GGTCATCCAC AGCTGGATCA AATGCAGCAT CTGCAAAAGC AGACAAGCAC TTCTCACGCA 1320 

ATAGCGAAGT GATGATTAAA TGCGC CTTAT CATAAATTTC CCCCGAAAGA CCTGCCTCCC 1380 

GcTCaATATT CATCACTCCT TCCTTACCCG CGGATGCCTG TGCCGTTAAC GAAATTACTA 1440 

TGCCAAAGCT ATGGCCGCAA TGTTCTTCGA TTGCCAGCGC GTTAATTCTC CCTATTCGAT 1500 

ATCCCTGTAC CTCTACCAAT AACTCTCCAC AGGCTATCAT GCGCTGAAAA CGCTCCCGCG 1560 

CGCGGGAACA CACGTACTGC CTGCGATTCA GCGCTTCCTG CACCACATGC GCAGTAATCA 1620 

CCGACACATC TGGATGCATA TCCACGGCCA CTGCATGAGA CTCCAATACT AAATCTGCAA 1680 

TTTGTACAAA AGAAGTACTG AGCCGTGTAT GACTCTCTGC CAGTTCTTCC GCATATGCTA 1740 

ACAATCGCGC GTACGCTGAG GAATCAAGTG AGAAGGTGCC ATAACGCGCC ACAACTCTAT 1800 

CAAGGTAGGC TATCAAAGCA ACTTGATTCT TATCAGAATT CGGCATGCTC GTATCAAATT 1860 

CTGCACACAC CTTAAATAGT TCTCGGAACG AGGAATCTTC CTGAGATAGA CGTTCGAAAG 192 0 

AGCATGGCTC GCCAACCAAA ACAAGCTTGC ATGTGAGCGG AACTCCTTCA GGCCGCAACA 1980 

TACCTTGGGA CTGGGAACTA CCCGCTGGGA GTAACACCTG CTTGGTACGC AGCGCACGCT 2040 

TCAAATGTGT CCATGCTTCT TCCTCCGCCA GTAGATCTTC GAGCTGCACG ATGAGTACAC 2100 

CTGCATGCGC TCGATGCAGC GCACCTGCAC GAATGCGTAA ATGCCCATTC TCCAGCGTAT 2160 

CCCCTTCATT CCCCTTGCTT TCAATCGATC CACATAAATT CGCCAAATTC GGCTGATGCT 2220 

CCGTAAACAC ATACCCCGCA TGTTCTGTGT GCACACACAC GAAATTTAAG GTATAGCGAT 2280 

CGAAAAACCG CTTTTTTACT AGTGCAGAAA TACGCATTGA CATTAAACAC TGTACCCGCA 2340 

CCTGCACATC GgTTTGTATC CGCTCTATGT ACGATATGAT GCGCGCACGC ATTTCTTCAA 2400 

ATTCATGCGG CACCTCAATG AGGCGAGAAG AGGAAGTCTT CTCCCTTTCC GTCTGTTCCG 2460 

CCGCACACGA AACAGGTGGA GGGGAAAAAA CCGGTGCATA ACACGCCACG CGTTTTTTAA 2520 

TACAGGCCAT CTGCTCGAGG ATAATTAACC GCAAACGCGC ACGGTAGTAC TGCGC AAGAC 2580 

GCCTAcGCGC GGCCCGycGC gCAGTGCGCA ACGTATGGAG CAGCGAAGAA ACCTCATCAC 2640 

AGGAAAGACG ATATCGCGCA TGGAGTTCGT GTACTACACA TCTAGAAAGC TTCGCGCGGG 2700 

AAGCTAAATC GTGCAGCGCT. TCAAAGCTAC TGTCTTTTCC CTTTAATAGG GGAACTAAAT 2760 
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CAAAGGAGTA CGTACCACGT TTCCTATGCC ATCTCACACG AAACCCACGC GTATACAACT 2820 

CCGCTTCAAT ACGTGAAAGC TCAGCACACT CACGTGTTTC AATATCAGCC AGCAATGTGC 2880 

GCCGCTCACG TAAGAAGGCA TCACTTTTCA CAATATCCTG TGCTGTATTG AGAATAGCGT 2940 

TAACCGATCT CCTCAACGCC GTAGCAAAAG GCATCCCTTC TCCCGCCGGA AACTGCAGCA 3000 

CcTGCGGTTC GTGCGGATGC ACAAAATTGT ATGCATACGC TATATCCCAC ATTTGCTCGG 3060 

GACGAGGTAC AAAATCTTTT AATAGATATT GGAGCGTAGT GCGCTTCCCG GTGCCTGACG 3120 

CGCCAATCAC ACAGATATTG TAGCCGTCAC CGTACATTCT CATCCCAAGA CGTATCGCAG .3180 

CGCAGGCACG TTCATGCGCT AGCAAAAAGG GGTGATCCTG CGCACGGGCT TTCAGATACG 3240 

CAATAGTGTC AGGAGCAACG ATACCCGTAA CTTCTTGCCA GGAAAGCTCT CGCCAGAGCG 3300 

AGCTACGGCA ATAGGGCCCC AACATCAAGT TCCCCAGCAA AGCACGGCAT GCGGATATCC 33 60 

CAATTACTCA GCACtGCGGA TGTATCTCCC CAAATATACC CACACGAGAA CCATGCACGC 3420 

AG AT AC TC GC TTGTCTGCCT AAGACAAACC GCGGATCCTG CGCTTCCTCT ACCTGATACG 3480 

GAAGTTTCAA ACAATACAAA AGTCCTGAAA CTAGGCTCGC CACCTCATTG TAACTTGCCT 3540 

CCTGCGAGGC ATTGAGGAAT CCAAGTGATT GTTGCGTGCA TGTACCGTGT TCACCATGAG 3600 

GGGAGCAAAA GGCAACCTTT CCAATTTCGA AAATACGATG TGGATACAAC GCATGTGCTG 3660 

ACTTTAGCTC AGCTGACAGA AGACACGGAA TGATAGACCT GCGCACGAAT CGGTAGCTTT 3720 

CAGTAAGAGG ATTTTCTATC TCGATTAAAT CATCTGCCGT GC ATg CAT AC GCGTGCAAAA 3780 

CTCCCTGGCA GAGCCAAGAT AATGAAAAAT CATCTCCTGA TAACCAAATC CAACTAGAAG 3840 

GTGCTTAATT TTGCGCGTCA AAAGGGTCAG ATCACTCAGT CTGCCCACTG TAAAAGAACA 3900 

TGGAACTGCG GAGAAAAACG GTCAAGCGTT CTACCCAGCA TCACTTCTTC AATCAGATCT 3960 

ACCGCATGGA GAAAATCATT ACGGTACGCT GCAGGCTCGA CGACATAGGT TCCCTGCTTA 4020 

ACTTGCACGG CACAGTCCAT CCTTGCAAAG GCTTCGGTTA TATCTGCAAC AGGCAAGGGC 4080 

ATCCCCAAAA GACGATCTAT CTGCTCGTGC GCAACCTCCC TTTTTTCTTG AAAATAAAAG 4140 

GGGGTAGTAA CAGACCTCCC CCATGGGGTG tCAAAACTAT AGGTAATTTG CACAGGTTCA 4200 

ATCTGCATAC CCATATCAGA AAGATCACAC GCAAGGCTAT TCACCGCTAC TAGCACTGCG 4260 

CACATATCCG TACCACTACA CTCGATAAAT AGTTCTGTAT CCCCGGTCAC TACTGCACCC 4320 

AGCGACGCGC TATTAGTGAT CGGAATAAGC GAAAGCACCT CACCACGAGC ATCGACTAAT 4380 

AAGGGAACGA AAGAATAATC CTTGAGGAGA TGACCATATT CACGCCCTTT CGGATGTTGC 4440 

GTCAGAATCT GCTCACCAGA CAGCGGCATG GACATACCGA GAGGCGTGAA ACTTACCTCT 4500 
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GCGAGTAGGA CAGCGCGGTA TGCGAGTGGC CAACATATAT CCTGTGCACG GTAACACCCC 
AGCGAAAGGG TACACCTCCT ACGTCCAAAA TTACTGGCGA GTCTTTCCTG TGTTTGTATT 
AGATCCCACA GTCTTACTTC GCTCAGTCCA GTACCCCGAG CCACAAAGCC AGCAATAAAA 
GGGCGTATAG TTTGCAAACG CGCATCCACT ACTACGCATC GCCCATACGA GTCGCGCACC 
GTACAGCGAT CTGAAAGAAA TGCACGATAC TGCGCTGCGC gsGAAgCCCC CCCTGCGTGA 
ACACGCAAGA GGCGCGCAAG CCCTGCTGTA GACCACAAAT CCGGACGATT TGTGTCGTTA 
AGTTCGATTT TTAAAAGACG CGCATCAAAC GCACCCTCAC GTACACAGCT GGcCTGaCCC 
CTCGTCCTCT GcACAGGaTy GcACACCTcC CCTcCTGGcG AAAGGCTCCC TCCCTcTTTA 
TCTTCCCTCA CGGAAAAAGA GCTGCACTCC AATTCAGCCT TAGCAtTCGC AAACGTCGCT 
CCAACGTATC ATCGTCGCAC CGTTCACCGA GCAGCTCAAA AAACAACGCC TCATGAATGC 
TAATTTTCGG CATGAGCCCT CCGACCGTGC TCAGTATACG GTCAAAAAAA GGACACCACA 
AGGGCCAGCC ACCACACCGT CTTTGcAGAG GATGCCACTA CACGGCACAC GTGCTCTACA 
CTTTAGTCAT AATACGCGTA ACACTCTTAA ACTGAGCGCT CATATGCGCA CGGCATAAGT 
ACCGTGCGCm CAAAGGCATG AGCTCACTGA CGACCGAAAC GAATACTCCG ATACTTTTCC 
GGAAATTCCT GGTCTGTTAC CGGTAAAGAA AACTTACCCA AAATATGTAT GTCAGAGAAC 
ACTGACACAT GGTTGGATAC TTCCTTCCCT GTGAGAAGAT CCGTGTACGG CGCAACCCTC 
GATCGTGTTC CTTTGGAAAA CAAACTAAAG GCATCAGAGA GAGCTACACG CATATCCTTT 
TCTCGGAGTT CATTTTGCAA TATCCTGCGC GCATATTCAG TTAACCCAAG TGTGCTGACA 
AAAATATGCG AACTTACATT CACCCCGAAA CGACCTACCA ACCTCTTCTC TTCTACCGCC 
GaCTCGACGC GcTCCAGGAT CCCCGACACA TCCTCATCGT CAGCATACTC CACCCCGAGT 
GCTTCAGCAC AGTCAAACAC ATTCGCCCCC AGAAGCATGC CATCTTTGCT TACCAGTTCT 
CTCATCAGGG AAACACGGTG CGTATCATTA GTGCAAAAGA ACATAGTGTT CTTCCCATAC 
TTACCGATCC ATTGCTTCGT GTGTGTGCCG ATAAACGCAC GCGCCCCTTC AACAC C TG AC 
TyCCTGCGGG ATCTGGAGCC TCCTCAGACG CAAACTCTAT ACCGATATCT GCACACACTG 
CCTCCATTAC CTGACGCCTC GTCCTAAGCT CAGCCTTTGA AAGGTGACGA GGAAAAGAAA 
CGTGCACCAA CGTACGCGCG CCCATTTTCT TTGCAGCCCA GGGAATAGCA TACCCACCAA 
ATGCATAATC ATCACTAACC ACGATATCAG CAGATTTTTG GATAAGCTCA GGATACTCAT 
GCGGATCCCC TACAAAGAGC ATGATGTCGG AACG t TCGCC CTAATTTTGT TAAAAGCCTC 
TGCAGTTCCA TGAATTCCTT CGCTGACAAC AATAGCACCC ATGAGAGGAT CATCAGCAAA 
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TGCAAGGAGC CGGGACACCG TTTCATCTTG CCGATTTACA AAATCTTCAG GATACGTGAC 6300 

ATGCAAGATA CATCCTCGGT GACCGTCACC TACCGCACCG TACATGGCGC ATGCTCTTTT 63 60 

TGCCGCAACT ACATCATGGC TAGAGTGACT ACCCGGACCG GTCATAACCC CGATGTGATA 6420 

ATCCTCCACC GCTTTCGTGC CCCCGGTATT TTGTGGAGCA CGCGCGCCCC CACAGCGACA 6480 

CAACAGTGCA CCGACACATA CCACCCATAC CTTCAATAGA AACTGACTTT TCATAGTCTC 6540 

CCCTTAACGA TCTGCACACA CCATCTCTCC AAAACGCTTA GGCGTATGGT CCACCCCCCC 6600 

CCCACGGGAA AATCAATTGT GTATTTATGA TACAGCATCA AAAAATCATC GTGTGCAAAC 6660 

CAAAAAGGTA TCCCTTCCCT CCCCCTTATT GAAGGCTCCC TCACCACAGA ACCTTACACG 6720 

AACACACGAG cTyG t CCGCT TATCCGGCCA AGAACCATGC TCCAAACTTA TGTAGGGACG 6780 

ACGCGCAAGC ATTCCCTGCG ACCACTCAAT GAGCAACTCA CATCTTTCAA ACGGACTCAC 6840 

CGCCACCCCT ACCTGATAAT ACAGATTTTT ACCTGCGCGG AACGTGCGGT TCCTCTTTAn 6900 

CGTTCCGGCA GCACCACCTG GTCCGTAAGA AAAACCcTGC GCACCATATG TAGCGCCATA 6960 

CAGCGAAACC ATCGGTCTTA TCCATGCGTG AGcAGnCAAG AGTAATCCGG TAG TT AAGCC 7020 

ACAGTTTCCC CACAACAGGC AACGCGCTGT GTGCACCCGG ATGGACAACA CCGGGTGGAG 7080 

GAAGCACAAA CGCCTGAGGC TTCTTTCCCC GTGTGGAATT CGGCGTACAG GAGCCAGGTT 7140 

CCAGGTACAA GCCATGAGTG AAAGGAATAT ATAAAcGCGC CTcGAcACyT CCGCTCAGAC 7200 

CGGACACAAg CTTAGAAAGA GAATCGTACC ACTTACTTTC CACCCCG 7247 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2898 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

AACCGnTTCC CGTTGTGCTT CTGCAATTTT TCTGCTGTGC TCGCTGCTCC TGAACTGGCC 60 

TTCACGCGCG AGGCTCTCGG CCTGTTCGGT TGCCTTCTGC GCAGTGTGAG CCGCCTGCTG 120 

AAGCTTCCCT GTGCGCACGC TCGGCGAACG CGCTCCTTCT CTTTTGCTTC CTGCGTACGC 180 

GnCCGGGTCC GGTGAAGTTC TGCGCCGCCT GCTGAGCTTG CTGAGCTTCT CGGCGTGTCT 240 

TGTCCGCGTG TGCCTGGGCG CGCTGAGCCT GAgTGGAaCA cTCCGCCGCG CGCTCCTGCG 300 

CCTGCTGTGA CTGCTGTGCT AGTTGTTGGG CCCGGCGCGT TCCTTCCTGC GCCTCCCGCT 360 



Printed from Mimosa 02/03/22 07:30:25 Page: 795 



WO 98/59034 




794 



CTTTAAGGTC AATCATGTcC CTGCGCAGAT CAACGTTCTT GTCCGGATCT TCACGCAGCC 
GCTCCACCAC CCGC TTGTC A GAAATGGCGC CGGTGTCCAC TGCGCTGAGT GCGTCAGAGT . 
GCCCGGAAAG GGGAATGACG AGCTGCGTTT TTCCCGGCCA CTGATCGAAA CGGCGCGCAA 
TACCGACGTG CTCGGGGGAG AGATGCGCCG TTACCACCCC CTTGTAGCGC GCCTTGAATG 
TGGGAAGGTC TGCACGATAC ACCGCGTTGT ACACCGTTGT GAAGGTTGCT ATCGTGCGTG 
CGTCCTGCAG GCGATAGCaT AGGCGGCAGA AAGATAGGCG GCGACGATCT CGCGTATGTT 
GTCGATATGG TCCACCCGTG CCTGTGCGCC AATGATGAGG ATGTCTGCAT CAAAGCCATT 
GGTCGTGTGG GGACCTAcTG CGTGAATAAG cgCGTAACGC gcACGATCCC CCGCAACACC 
GCCGCGCAGT GCAGAGGCCA AGCCCTCCCC GATGCGCCTG ATAGCGGCGG CGCTATCTAC 
GTCCGTGTGG GTTCCTGCAA AATTTTCAAA CTCTACCGTG GCGTTTGCAC GCTCAAGCTC 
ACGGCGATCC ACCTCGAGTG CAAAAAGACA GCCTGCGCCC AACACGGCAC ACATGGGCAG 
CACGTTCTTC ATGACCCTCT CCCCCTTCTA CACCTTTGTT TTTGAATACC GCACGCCTAG 
ACACCTGAGA TCCCAACCTT CCTGCGTCAT CAAAGCCCCA TCAGCGTACA TnACTCTGAA 
CGAGAGTTCT TGCATAGGCC CGTATGGTGT ACGGGAAAGC CCCCGTGGAG ATGCGGGAGA 
GTTCTGAGAC TGCCTTTTCA GAAAGGGTGC GCGGTGCACT GCGCGCGCAT GCAGCGAGTG 
CATCGCACGC TGC ACGCATC AGCGACTCGG CGCGTGCGTC CTTAGCACGC GTGACTCGGT 
ACAGCGCAAG TAGTGCCCGC TCATCGAGGC CGAACGTGCA GCGGGTGAGC CCTTTGAGGA 
TACCTTCGAG CACGGTGTAG TCCTGcTCTT TGAGAAGCTC AAGGAGCGCG TCTCGGTATT 
CGTGTGCGCC TAAGGCTCCT AACAGCGTTG CTGCCTGCGC TCTCCTGGCA GAGGTCACTG 
GAGGCGCCCG TGCGCGGGgT GATGCGGTGA AGTAAGAAGC GCGTGCAACG AGCGGGGCAT 
AGGCCGGTTC TCGAGGTCmT ACATTcCTTC CTTCAGGGAT GTTTGCACTG CCTGAAGGAG 
AAGCTCAGGG GTGGCGTACG GTCCGGGCGG CGACGTTGTG TGCGTCTCTT GCGCGTTGTC 
TGTCTTTCCG CTGTTTTTTG CAGGAgCGGG GGAAGAGAAG AACGGGTCGG AaGAgATTGA 
GTCATAAGGC GACCGCTTAG CAGGGGCAGC AGGGCGAGAA GGGGGAGCCT CTTGCACGTG 
TGCACGGCGT GAAGAACCGA GCACGTGAAA ATACTATCGC GGTTGTATAT GGGCGTGTGG 
TCAATGAGTC TTGTTTCTGC ACGGTATGCG TTGAGTACCC AACCGCCTGC AGATACTACG 
AGCCCGTCCG TGGTGATAAG CGGAGTAAAC GGTGTTTCGT ACAGCGAAAT GTTCCACTTC 
AGCGCGCCTC CTGCGCTGAG GGTAATCGCC CGCGCTGTAT CGCTGAGCAt ATTTCGTCCC 
CCTGTACGCG GCACTGGGCA GCCGAGGAAA AGGTACACGG AAGTGTGAGC GAGAAAAGCT 
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CATTTTCCCC ATCTGCGTCG 
TGTTTTCAGT AGCAAAAAGA 
GCGCCTGGTG nCTGTTCGCG 
TGTCAGTGCG CACGCGCCAC 
TCGTCAGCGC ACGTACGGAC 
ACCAGTACTC CACGGTTCCT 
CGGAGGAAGG AGGGGCTTTG 
GAAGCACAGC AGGGCGTGTG 
CGCGGTGCAC AGCCCCCCGT 
AGGCAGGACA GCAAATTCTC 
GCTAACCTCG CTTACGTACA 
TACGGGGCAG TTTCTGCGCC 
ATCTGCGCTA AATGTATAGA 
TGCAAGGACC ACACTCCA 
(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 

CACGATGTAC TGCATTCGGn GGCATGCCCA GGACTCGGnG GATGCGCGCG TCAGAGGTGA 60 

TAGCGGATAT CTACCCCGGC GGTGACAAAC TGCGCATGGC GCGTGTTAGT CTGcTCtCTC 120 

TCtTCTTCAA CCACTTTTTC GCACGACCGG GGTACGCCGC TATATGCAGC ACTGGCACCC 180 

AAGGACCAGC GCTCTGTCAG CTGAAGgTAA CAGCCGGCCC CTGCTTTGAG CAAAAGACCG 240 

TAGTACGTAG ACGTGTAGTA GTGCTGGTAG CTGAAGCCGG CCCCCACCGT CAGCGGCAGA 300 

CGGATGCGCC AGAAGGCGAG CGTGTAACTT GCATTCAGCG TTACAGGAAC GGCAAGgTAG 360 

GAACACAGCC GCTCCTTCTC GTAATAGAAC GGATAACTGC AGGAGTGCTG CGCGCTCGCA 420 

TCAATCCCGA GCGACAAGCC GCGATACACA AAATACTCGT ATCCAAAAGA CACCCCAAAC 480 

GCGGGGTAAA CTTC cTTATT CCCGTTGGCT GCAACGCCCC CAGATCCCCG CGGGCGGTAT 540 
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TAGGCACGCA CCGCACCGTC TGCAAAAAGG GCGATGAGCA 2160 

GCGACACAAG AGCGCTGTGT GCTGATGCGC ACCGGCACCG 2220 

GTCTTTCCCG CCTGAGTCTT ATCAGGCGTA CTGTTTAGCG 2280 

AGGCCCTGAT CCGTGGCAAC GACGTACCCG CGTGGGGTGC 2340 

CCTGATGCgc GCAGGGCGCC TTTTTTTTCT CCAAATATGG 2400 

GCGGGCCCGT GATGAAAAAT TCGCGCGTTT TTGCATCTGT 2460 

CGCCAAGCTT CTGACTCCAC AGGCGCTGGC CGGTACTGCT 2520 

CgGTGGTGTA CAGCAGCCGT CCGTCTCTTA AAATTACAGG 2580 

TCGCCGCGTT CACCGCACGC TTTGCACCGC GCGTGGCAGC 2 640 

GGACGGGTAT GCCCTGGCTG GAGAAAACCC ATACGCGTCC 2700 

CGAGTCCATC GTCAGAAACG CTGAGAAAGG GAGTGCCGAC 2760 

ACACGAAGGA TCCTTTGGAG TTTACGCAGT TGAGGGCGCG 2 820 

TGAGCGAATC GTAACGCACC GGCTGTGTTA CGTTCTGCCC 2 880 

2898 
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TGCACCAGTC CACCTGGAAA AGGGGCACCG CGCCAATAAA CGAAACGCGC GCGATCCCCC 
GCCCCGCGnC GGTTaCATCC CACGATGCGG CAGGCGCCAC CTCCTGCGCA TACGCTGCAC 
CCGCGGcAAC ACCCACCCCG CAGcGcAGAC CTGCAGAAAG CGTATCCCGT GTGCCGCGCA 
CTGTTTTAGC TGTTTCATAC GTTTTTTCGC TCCACACCTT TCAAAAAGCT GCGGGCAACC 
CACACAGCCT TCCAAAATTC TACCCCCCCC CGGCCAACAT TTGTCGAGTT CTTTTTTTGC 
AGGAGGGTCG CGCCCCGCGC ACCACCGCTG cATACCCAGT GCGCGCCTGT AACATTCTGA 
CCGGGGAGGT GTTTTCTCAC ACCGGGGAGG TTTTATGTGC AAACCGCGCG TGTGGCGCAT 
CGCCCACACC ATCGTCCATG TAGGCGCGTT GCTGCTCGGC ACCAGCCAGC TGACAACCTG 
TGATTTCTCC GGCATTTTTG CCACCATTCA GCAGGAAGTT GCCATTAAGT CGCCGTCTAT 
TCCGGGGGCG ATTTATGGCC TGGTCAAGGC CGGGGATAAG CTCTATGCCA CCAACGGTCG 
GCTTTGGGAA AAGGAGCTGA ACGGCATTAA GTGGAAGCCG GTGCCTTTTC TTGACGGCCA 
AGATAAGCGA ATTGATAGCC TTGCAGCCAG CAACACGTGC GTATTTGCCT GTGTTTCAGG 
AGACGGTGTG TACAAATACA CCGCCGGCAC CACCTCTTCG CAGAAGGAGA GTAATACGGA 
TAAAGCGCAG GCGGTGGTAC AG ATG TCGG A CGGAAAAGTG GTCCTGCAGT GTGCCTTGGG 
GGATGAAAAG ACGACCCCGA GCGACGCAGA CGAAAGGTTG CTGGGGGGCG GCCAGGGCTA 
CCTCGTCACA TCCAAGGGAT TTTACACCCT CCCAGGGTCA GCCTCCTGCG AGGTTATCTC 
CGAAACAAAG GACGTCACCT GTAAGGCAGA GGCGCCGATC CTCGCCAGCG CCTGCGATGG 
CAGCAATACC TATATCCTTA CCAAGGACAA GGTGTACTGC CGGTATACGA ACGGCTCAGG 
GAGCACCCCC a CTACGTGGT GCGACGTGGA ACACAAGGTA TCAGAGCCgc TTGCGCTTGC 
AGTGTTCAAA AATAAGGGTG AGACGTTCTT GCTCGTTGGG GGACAGCAGG GATACGGGGA 
AATAAAAATA GCCACGGCAA GCGGCAGcTC CTCTTCTTCC TCATGCGTTC cCCTCACGGC 
GGAAAACGTG CACGCCACCA CCGGGTGGGG CGCCAACTGC TCCACCCCGG AAGGCAGCGC 
CGAGCAGTAT CGTAGTACGA TCGGCCGCTG GGCAGTGAGC GGTATTTACG TAATCAAAAA 
AGACACTAGC GGTGGGCGGA AAAAGCGGAg CACCTCAACA GACTGCGAAA GACCAGACCT 
CTACGTGGCG GTGGGGGATG CGAGCGACAC CTACACCGGG CTCTGGAAGT TCGACACCGC 
TACGAATACC TGGAATCGCG AGTGATGGCG CGCAgCAGAT GcGTGCACCG CGTGGTGCAC 
CAGGCAGCGT GCATCGGGGT GATAGGCCTG AGCACCAGCG CGCTGACCAC GTGCGATTTC 
ACTGGCATCT TTGTGGCCAT CCAGTCGGAA GTGCCCATTA AAACGCCGTC TATTCCGGGG 
GCGATTTATG GCCTGGTCAA GGCCGGGAGC AAGCTCTATG CCACCAACGG CCAGCTTTGG 
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AAAAAGAACG TAGCAGAAGA AGGTAAAGAC TGGGAGCGGG AGTCCTGTTT CGACTCGGTG 2340 

ATAGGCGACA GCCGCATCAC GAgcTTGCGG CAGACAACGG CGAGAATGGC GTGCTCGTTG 2400 

CCTGCATTCT TGGCAAGGGG GCGTACAAGT GGTCGCAGGG TAGCGCCGAC CAGACAAGCG 2460 

GAAATCCGTC TGCCCTGAGT GGCACAGAAA AAGCACTCAG CGTGGTAGGG ACCGGGACAT 2520 

CATGCGTGTA CCTTAACCAC ACGGATGATA AGGTTGGGGA AACCAGTAGT TCGGAAAGTG 2580 

GTGG AaTGc T GCGTCAGGAG AAACGAATGA GTTCTGCCTG CACGCCGGTA ACGGtTTTTA 2640 

GTTACCACCA AAAAGGTGTG TGTCGGTAGT GATGGTTCTC CCGTGGCAAA GAGTGATGGC 2700 

GAAGAACCAG TTCCGCCGAT TCTTGCGGCA AC TG ACGACG GGAGCGGGCA CGTTTATATC 27 60 

CTCACGAAAG ACAAGGTGTA CTGCAAAAAA GTTAATCAAA GCGAAGGGAA AATTCAGGAT 2820 

TGCCCACAGT CTGCCGCAGC AGCGCCGGAG CCAACCGGGG CACACAGTGT TGCCCATAAG 2880 

GTAGCAGACG CGCACTCCAT AGCGTTCTTC AAAAACGGCA GCGACGAGTT CTTGCTCATC 2940 

GGGGGCCGGC AGGcTACGGA GAGATAAAGC TGGAAAGAGG TTCAGGAAGC AACGGGAACG 3000 

GAGCACAGTG CGTCCACCTG AAGGAAGAGA ATGTACACGA TCAAACCGGC TGGCATGAGA 3060 

AgGGCTCCAC CCCGAAAGGC AGCGCCGAGC AGTATCGTAG TACGATCGGC CGCTGGGCAG 3120 

TGAGCGGTAT TTACGTAATC AAAAAAAGCA CTAGCGGTGG GCGGGGAAAG CGGAGCACCT 3180 

CAACAGACTG CGAAAGACCA GACCTCTACG TGGCGGTGGG AGACACGAAC GATACGTACA 3240 

CCGGGCTGTG GAGGTTTGAC TCCGCCGCGC AAAAGTGGAA CCGCGAATGA GTGGCTCTAA 3300 

CCtACGCTTC CCCTACTCCC GCGCGTACGG CGGGGCAGGC GTAGGGCGTA ATTTTGAAAA 33 60 

ATCGAGCAGA TTCTCAGTAC AAAAAGAGGG TATAGGTGCG CCGGGTTCtG cGCnGTACCG 3420 

CTTTGCAGTT CAAATTGTTT TTGCTTTCCC GCCTCTTTTT TATTTTTACG TCACATATTC 3480 

CCtAGACGGG TGGGGGGGGG TGAGGTAGAA GTGAGAGGAG GGGGAGTGAG TGGGCAGGCA 3540 

GGTGATGCAA GCGGGGkyAC TTGCGGGCAT GGTATGTGCT GCTTCTGGTT ATGCAGGCGT 3600 

ACTCACTCCG CAGgTCAGTG GCACAGCCCA GCTCCAGTGG GGCATTGCGT TCCAGAAGAA 3660 

TCCACGCACT GGCCCGGGCA AGCACACCCA TGGGTTTCGC ACTACCAATA GTCTGACTAT 3720 

TTCCCTGCCG TTGGTGTCAA AGCACACCCA CACCCGCCGA GGGGAGGCAC GCTCAGGGGT 3780 

GTGGGCACAG CTGCAGCTGA AGGACCTGGC AGTAGAGCTT GCGTCTTC T A AAAGCTCAAC 3840 

GGnCCTGTCC TTTACCAAAn CTACCGCTTC CTTCCAGGCA ACCCTGCACT GTTATGGGGC 3900 

CTACCTGACA GTGGGTAnAG TCCTTCCTGT GTGGTTAACT TTGCCCAGCT GTGGAA 3956 
(2) INFORMATION FOR SEQ ID NO: 146: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1314 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

GGAAAGCACT GCGTACTAGT GAAGCTAACG CGGAGACTCC CGCGAAAGTC AAGAAAAAGC 60 

GCGCCTTTGC TCGCTGGGAT ACCAGAGAGG ATAGCTTGGC GGGCGAAACC TTTTTTTTCC 120 

GCTTCTGGAT TGGGGTAAAG TCCTTTTTTC GGTCTTCGTC GGTGGAAGCG TTGTACGAGG 180 

AAACACTTCT CCAGCGTCTG GGCAATGATC TT AAGGCGC A TTACGCGCAG TACATTGACG 240 

TAAAGGAAAA AACCTTCACA AAGGTTTTTT ACGACAAGAT GGGCGAGTTG CGCAAAACAC 3 00 

AGGTATTTTT TGATTCATTG CTCGCCTGCT ACAATAGCGA TAAGGGGGAC TTCTACCTCC 3 60 

TGTTGAGCTC TTTTATCACT CCCGTTGTGT ATGAGCAGCT GATGGCGTGC AGCGATCCGT 420 

TTGCTGCGCA GGGAGATGGT ACTCCTTCTG GTTTGCGTGC GTCCCTCCTT AAAAGAATGG 480 

ACGTCGCCCT TGCCACCCTG AGTGGTCCTC ATAAAACTGA GTTGTATCAG GCGGCGCGCG 540 

CTATCGAGTG GATGAAGGTT TTTTGTGAAG TGCCTGTCGA TCGTATTCTG CTGCGTTTTA 600 

CCGTCATCTC CCCGGCGAGT GCCGTGTGTC CCATTACTAT TTTGCAATCT GAATTGGAAA 660 

AGCTTGCGTG TGTTATCCAT GACAGCAAGC ATATTCCCGA CGCGGTATTG CAGGGGCTTT 720 

TTGTGCTGAA GAGTAAAACA TCGCTGCATG ACGCGCAGGT GGATAACGCT GCGCACGCTG 780 

CTGCCTTTCT TAAGGAGGCG AGTGCAGCGC TCGTTGTTAT CAAGGATTTA TCGCACAGCA 840 

TTCCTATTGA GGATTTTGTC CGTTTTGCAG GTAGGAACAT TCGTTGGCAA CCTCGGGCAA 900 

TTGCCGGTGG TGAGGATTGG TTTGTCCTTT TTAAGAAAGC GTGGAAAAAA CGTTTCAATG 960 

AAAAATGGGC GCTGTGGTCT ACCGCACAGA AGCGTTTAGT GCTTAAGGAG CAAATGCTTT 1020 

CTCTTC TCGG AAGGGAGCAG TTCTCAGAGT TGACTCATCG GCCATGGGAA GGGTTTTGGT 1080 

ATCAGCTTGT GTTTAGGCGA GAGATGTCTT TTGTCTTTTT AAAAAATTTG TTCGAAGgTG 1140 

CGTATGCGCG CGTCGTTTCA CCGCCGCTGA ACGTTATTCT TGCTGAGGGC AGTTTCTATC 1200 

GTCGAGATGA GTTGATAGAG TACACTGACG CGGTAAATGT ACTGGAGCAG ATGGGAGCAA 1260 

AGATTAGGAA TTTCGAGGTA AGACTTTCGC CGGTGGGGGA ATGGGGAGTC GCCT 1314 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1058 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 
GTCGAGTATT TCGTTTGAGT AGGTAATTCC TCCTTGTGTG TTGTCCTGTC TTGTCGGGGT 
GCGAAACATC AGATTACAGA ACTGTGTTTC AAGCGGCGAT ATGCGCAGTA TTTTCTGTTC 
GAACTGTTTA GAAAGATATG TCCCTCGTGC ATATGGCTTT CCGTTGGTGT AGGAAAAGTC 
TGTTCCAACA GAAGTGATAC GGGTAAACCC TAGAAAGGAA GCCACCGAAT AGGCGGCGCC 
AGCAACCGTT nCCCGAACTA GTTTCTAAAA AAGGTAGGGA AGAAAAACGA CTTGCGTATA 
CACTAAAAGG GTGACCTCCT CCGGTGAAAA TAAGTGAATT ACCATTCAAA TAAAAACTGC 
GTACTGCGCA TGGACACGCG CATATGTCGA ATATTGCAGT TACCTGTGAA GGAATGAACA 
CATAGTGTGT GGCATACGAG ATGTATTGCG GATCAATGCT GATAAAAAAA TCTGGCATAA 
GACCGCCTGT GCAaCACACG GGAAAAGCAG TATCGCATGC AAAAATAGTG TACACATCCC 
TATGTGAACG TATTTTTTCA ATACCTTTTT CAAGACTTGG CCCTGCACCT AATATGATCG 
CCTCAGTTTG TGTATTGATG TTCGGTAGCT TTGGTGTGAA TATCTGTGCA TATCTTAGAT 
TGAGTAATGC GTTTCTCATC CAAAGCTTTC CAAAGTGCAC TTGGACTGAA AAGTCGGCAC 
GAATGATTTT CATTGCTTGG CTGGTAAGCT CAGCGATCTT TTGTTCGCTT GTGGGAAAAA 
ATGCCTTCCA GGGTTGTAGA TAGTGAACAA AAAAATTGCC GTGGATGATG GGCATATAAC 
ACTGTGCAAT TTCTTGTGAT GCACTACCGT GGGTGAGCGG ATGAAGAAAG TGGACGCGTT 
CTTGGACGAT AATATGCGTG AGGTCCACCT GGCTCCAGGA GTGCACGGAA ACTCGCGTAA 
TCAAATTCAC ATACCGCGCA cATGCGTGGA CGAAATTTTT CTAAGAAAAC ACTGATGTGG 
GATACCTGCC CCGATGCCAC AAAAAAGAAT GGAAGCAT 
(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 
TGACCGGCCA GCAAAAGGCT CATGCTCAGG CCGGCnCCGT TGCAGGACCG GGCTCGTACC 
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CCGnTGGGTG CCGGTTTATC CCCCGACCAG GGTTTTTTGG ATGAAGTCTG CCGCTGGCCA 120 

TTGCGTGTTG ATGGACAgTT TCtGGCGATT GGCGAGCCtT TTTTACGCAA GArTgCgGTG 180 

CaCGATGTGA CAGCGTTCaT TGATGCrCAC AATGCgCGCA AAAGGGCGCC gCGTGCCCCg 240 

CTGTTGGTTC CgGAcTTTTA GCTGTCaTGG GGAAAgCGAA TTTATATGCg CGTATTCGTG 300 

TCTTGGCATA AGCaAAAGGA TAAGGGGGaG CGCGAAcGCg CAGGTGCGCT GCaGGGAGCG 360 

TGCATAGGTA CGCCTGAGCA CCGCCAGtTT TTGCGCTGGT AGGAAGAAGT GGCAGCGGAA 420 

AGAGTTTTCG TGCGCAGATA GTGATGCGGC GTTTTGATAT AGCGTTCGTT ATAGACGATG 480 

GATTGCTTAT TCGTGAGGAT AAGATTATTG CAGtCGCTCG GCAAAGCAGG AAAGGACGCT 540 

GCTtGCGGCC ATACGGGTTG CGCTCTTCGA GAACACGATG CACCGTCGCG TAtTGCCCGT 600 

GTGTTGACGT ATTTTCTGTG TGGGTCACAG AACAAAAAGG TGCTCATTCT GGGGACTTCA 660 

GAGAGGATGG TGCGCAGAAT AGCCCTTCGT GTGGGGCTTC CTGAACCTGG GCATATCATC 720 

CGCATTGAGG ATATTGCCTC GAGCGAAGAA ATTGCCtGCG CgCgCaCACG GCACGTGaGG 7 80 

GCACGCACGT CATTCCAGTT CCCTCTGGGG AGGTACGCAA GAGCTATCCT AAGATCTTTT 840 

ATGAAAGGAT AAAACTCTTG cTGCgTAGAG AGGCAGGTGC GGAACGAATa GGACGGTGGG 900 

CGCACGCCAT ATGGCATGAG GGGCTCAAGC GTGcGTGCAG CGGCGCGCAC cGCATGtATT 960 

TGAAAAATCA ATAGTGCGTC CACCTTTCTC ATGCAATCTG CGCGTGGTGG CGGCGACAGA 1020 

GGGCACACAG GATGCGTCGC CTGTTGTGGT GCCGCCTCAC GAGGAGCTCG CTCTCCAGCA 1080 

GTAGGAGAGC GATGACAGGA GAGGCAGTAG TTTGTCCCGT AGCGTGTGGA GCCGGAnGGT 1140 

CGAAG 1145 
(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 860 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

TAGGAGTTTG TTTTCTAGAA nTTGTTTTTC AATAAAGGGC CGGTTCTAAG TCACGCTCTC 60 

CCTTCTGCCG ATAGAGGGnG CGGAGGTGGG GATGCTATGG TGATGGCCGC GGCGCCTTTA 120 

CAGAATTTGT GTTACGACGG ACTTTTACAG GTAACCTCAG GGGGCAATAT CTCGCTCCCT 180 

GtACCCTCGA ATCAGGTGAT ATACGCTCAT TTCGAACATG TTGATGCGAC TCCTGCGGAg 240 
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cGGAGrCAGG CAGGGGTGTC GGTGTCTGAG OTGCAGATTT TGGACGCGTT GGTCGAGCGG 300 

CTGATAGTGC AGCGTCGGGT AGCGGCAGAA GCGGCAGACA TGGCGGTGCA GAAGCGGCAG 3 60 

GAGACACTGC TCCGCGCCGC AGAGCTTTTT TCTCAGAAGC AAGTGGACGA GACCAAACGG 420 

CGGGGAGAGT CTCTTCCTTA CACCTCAGTA GAAGTACAGG GGCCTGAGCT TTTTGACTTG 480 

CGCGCGTAGG GCACGTGCAG GCAGAAGTAC TTGACTTTTT TGAGGATAGG GGGGAGACCC 540 

CGCCTGGCCG GAG TTCTGAG GCTGGGTATG GACTCTTTTG GAGAGTGGGG GTTGTGTGGG 600 

TATGAGAAGA TTGCTGGCAT GTTCGGCGGG GGTGCTGTGT TTTTCCCAGC TTGGCGCGCT 660 

TGAGTTGTTT CTTTCTCCTA AGATTGGGAT CACGAGTGTG TATCAGTTTG GGAGTAACGG 720 

TGGTTCGGAC GGTACgTCGT CGGGTAAGGG TGTGTCTTTT GATAGACTGA TTGGAAGGGT 780 

TGACCTGGGG CTAATTTTGG TGAACGGCCT AACGATTTCA GCTTCGGCGG AAAGTTCGTT 840 

GACCAATGTC TTTGTGCGTG 860 
(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13811 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

TAAAAAAGGT ATGGATGACT TGTGACTAGG AGTGAAAGGC TAAACAAACC TGGAGATAGC 60 

TGGTTCTCCC CGAAATGCCT TTAGGGACAG C C TT AT AC AA AACTGTCGGA GGTAAAGCAC 120 

TGGATGGGCT AGGGGGTTTC ATCGCCTACC AAAC CCAATC AAACTCTGAA TGCCGGCAGT 180 

CAACGTGTGG GAGTGAGACT GCGTGCGACA AGGTTCGTAG TCGAGAGGGA AACAGCCCAG 240 

ACCGTCAGCT AAGGTCCCGA AATACCGCTT GAG TG TG AAA TGAAGTGTGG GTACCTGGAC 300 

AGCCAGGAGG TTGGCTTAGA AGCAGCCATT CCTTGAAAGA GTGCGTAATA GCTCACTGGT 360 

CGAGTACGCA TGCGCAGATA ATGTATCGGG GCTAAGCGGT ATACCGAAGC TACGGGTCTT 420 

GCATTTTTGG TGCAAGGCGG TAGGGGAGCA TTCCATGTAC TGATGAAGGA ATATCCGGGA 480 

GGAGTTCTGG AGGGGATGGA AGAGAGAATG CAGGTATAAG TACACGAAAA GGAGGGTGAG 540 

ATTCCTTCCC GCCGAAAACC TAAGGTTTCC TGGGTGAAGG TCATCTGCTC AGGGTAAGTC 600 

GGCCCCTAAG GCGAGGACGA GGGTCGTAGT CGATGGGAAT CCGGTTTATA TTCCGGAACC 660 

TCTTGCAATT TCGATGGCAG GACGCGTGAG GTGAAGCCCG GCCAAAGATT GGTAGTTTTG 720 
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GTCTAAGTAT CCGAGCCGTT TTAAGAGCGA TAGGCAAATC CGTCGTTCGA GGTAAGGTGC 
GAGTGCGACT GGAGCGATGA GCGAAGGGAA GCAGGTGTAG TCATGGCGAC GGGAAATACT 
GTCTAAGGTT AGGTTGCAAG AGACCGTACC GCAAAC CGAC ACAGGTAGGT AGGATGAGTA 
ATCTAAGGCG C TC G AG AG AA CTCGCGTCAA GGAACTCGGC AAAATACACA CGTAACCTCG 
GGAGAAGTGT GACCCTTGCC TTTGGTGAGG GTGGCAGAAA GCAGGTCCAG GCGACTGTTT 
ATCAAAAACA TAGCCATCTG CAAATCAGTA ATGAGACGTA TAGGTGGTGA CACCTGCCCG 
GTGCTGGAAG GTTAAGAGGA GAGGTTCGTG GTAACACAAC GCTTTGAATT GAAGCCCCAG 
TAAACGGCGG CCGTAACTAT AACGGTCCTA AGGTAGCGAA ATTCCTTGTC GGGTAAGTTC 
CGACCCGCAC GAATGGTGTA ACGACTCTGG ACACTGTCTC GACGCGAGAC TCGGTGAAAT 
TTATGTACCG GTAAAGAAGC CGGTTACCCA TAGTTAGACG GAAAGACCCC GTGAACCTTC 
ACCGTAGCTT ACTATTGGAA CTTGGTTTAC CATGTGTAGT ATAGGTGGGA GACAGAGAAG 
CTTGGCCGTC AGgTTAGGCG GAGTCAACAG TGAAATACCA CCCTTGGTAC GTCAGGTTTC 
TAACCTTTGG CCGTGGATCC GGCAAAGGGA CCGTGGTAGG TGGGCGGTTT GACTGGGGCG 
GTCGCCTCCT AAAAGGTAAC GGAGGTGCGC GAAGGTCTCC TCACACCGGT TGGAAATCGG 
TGCGCGAGTG TAAAGGCACA AGGAGGCTTA ACTGCGAGAC CGACAGGTCG AGCAGATACG 
AAAGTAGGTC TTAGTGATCT GGCGGTAGCG TGTGGAAGCG CCGTCACTTA ACGGATAAAA 
GGTACTCCGG GGATAACAGG CTGATTTTCC CCAAGAGTTC ACATCGACGG GAAAGTTTGG 
CACCTCGATG TCGGCTCATC GCATCCTGGG GCTGAAGCAG GTCCCAAGGG TTTGGCTGTT 
CGCCAATTAA AGCGGTACGT GAGCTGGGTT CAGAACGTCG CGAGACAGTT CGGTCCCTAT 
CTGCTATGGG CGTTGGATAT GTGAGAGGAG CTGCTTTTAG TACGAGAGGA CCGAAGTGGA 
CGAACCTCTG GTGTACCAGT TATCCTGCCA AGGgTACGTG CTGGGTAGCT ATGTTCGGAA 
GGGATAACCG CTGAAGGCAT CTAAGTGGGA AGCCCGCCTC AAGATTACAT ATCCCTGAAG 
GTTGACCTTC CTGAAGACTC CTTGCACACT ACAAGGTCGA TAGGCTGGAG GTCTACGTAC 
CGTAAGGTAT TAAGCCGACC AGTACTAATA AGTCGTGAGG CTTGACCATA TTATTCATCC 
TTCTCCTTCA CCCTACCCCT TTGCGTAAAA TATTTCGCCT GGTTGCCATG GTGGAGAGGT 
CATACCCGTT CCCATCCCGA ACACGGAAGT CAAGCTCTCC TACGCCGATG ATACTGCTCC 
TTCGCGGGAA AGTAGGTAGT AGCCAGGCTC CCCTTTGCCC ATCACTCGCT TGACAAAATA 
TTATCCGAAA CCTTAAAGTC CGCAG t GGCT TCGGCGCATT CGTTTGACCT GGTAGATTTG 
GATGATGAGG ATTACGGTAC TATCCTTGCT GAGGATGTTG AGTTTTCAGG GACGGTTCGC 
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TTTGAAGAGC CCTTCATGGT AC GTGGTGTT TTTGAAGGGA GCATCGAGTC CTCGGGTGAT 2520 

. CTTATGGTTG AGGAGCAGGC GCGTGTTCGC GC TGAGAT AG TTGCAGATCG TGTTGTCATT 2580 

AAGGGAGAAG TAATCGGGAA CGTCACGGCT ATGTCTGTTG TTCGCGTCTT CCCATGTGGG 2640 

AGGTTGATAG GGGATGTTAC CGCGCCAGAA GTTGTGCTTG AGAGTGGGTG CTTTTTCAGT 2700 

GGCGTGTGTA GTATGCCCGA ATCGGGTTGA TCGGTTTCCC TTGCAGACGG GTAGCTCTCT 2760 

TGGTCGCTAG GTTTCTTTTG CCCGTTGTGT TCTGTTTCGT GTGTAGGGGG CTCTGTTCGC 2820 

AGCAGTCTGG TCTTGCTTTC TACCGGGCGG GTAGTTACCG TGAGGCGATT AGATGCTGCG 2880 

AGGAAGCTAT CGCACAGGGG TCTAAGGACC TTGAGCATTA TCTCGTGCTT GCCTGGGCGC 2940 

TTGTTGCGGT TGGGAAGTAC CATGAGGCTG CCCAGTGGGC CCTCGAGGGG CGCGCCGTTG 3000 

CCGCCCATGA TCCTCGCCTC ATAGAGGCGC AGGCGGAGGC TTTCTGCCAC TTGGGGAGAA 3060 

ATGAAGAAGC GCTTAGGTTG TTTCAAGACT ACATTGCGTG TGCCCCGAAT GGAgCgcGGC 3120 

AGACCGCCGC GTATTATCTT ATGGGGGAGA TTTACCTGCG CACTTCGCGC TTCTTGCACG 3180 

CAGATATCGC TTTTTCCGTC GCGCTCCAGT TAGATTCGCT TAACGATTTC TGGTGGGCCC 3240 

GCTTGGGCTA TGCaCgcGAG CGCGCGGGGG ACTATCGCTA TGCGTTACAG GCCTACGACC 3 300 

GAGCGCTCCA GCTGAATAGA GATCTGGCTG ATGCTCGCCG GGGCAGGGAG CGGGTGCTGA 33 60 

GGTACGTGTT CCGTAGGTAG GTTTTCTTCG TGTTTTCCGT TCGCATCGAG ACGCTTGGCT 3420 

GTCGTCTGAA TCATGTCGAG TCTGAGTCGC TGGCAGCGCT CTTCTTGCAG GAGGGCTTTG 3480 

CGGTATGCCG TGGCAATACC TCCACTGCCC CAGTGGTTCT GTGTGTGATC AACACCTGCA 3 540 

CGGTCACAAG TAAGGCAGAG CAAAAGGCGC GGCGCCTCGT TCGTCTCCTG TTGCGCACAT 3600 

ATCCTACTGC AATAGCGCTT GTCACTGGAT GTTATGCGCA gcTTGAGCCT GCTTCTCTTG 3 660 

AAGCCATGGA TGATCGTGTC CTTGCTTTTC CAGGAAAACA AAAAGATGCC CTCAGCCTTC 3720 

TGCCCTCGTG TCTCCGTGCG TTACTTGTGC AGCGTGGTCC TGCGCCGATA GATCAGTATG 3780 

TATGCGGTAT GCGTGCGCTG CTCGCTTCTT TGAAGAAAAA AATTATTTCT TTGGAACTAA 3840 

CGTCTGAGTT TCCATCGCAG ACGCATATGC CCACAAGGAA TGCCCTTCCT CAGTTAACCG 3900 

GTGTGCCTCA TGCGCCGCGC GTTTCTGTAT CTTCGTTTTC AGAACCTACA GCCGTTCCCC 3960 

GTTTTGCTTT GTATGCCCCT CGTTTTTTGT TCCACTCACG CGCAAGCATT AAGGTGCAGG 4020 

ACGGTTGTAA CAGTGGATGC GCTTTTTGTC GCATTCGTTT TGCACGCGGT CGCGCTGTAT 4080 

CACTTGAAAC ACACGAGGTA ATTGGGCGGG TGCAGGCCCT TGAAGCTCGT GGTATGAGCG 4140 

AGGTTGTACT TACAGGGGTG AACTTGTCTC AGTATAGAAG TGGCAGTATA GATTTTGCGG 4200 
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GTTTATTAGA GTTGATTGTG CAGGAAACGC ATACGATTCA TATTAGAATT TCGAGTTTGT 4260 

ACCCAGAAAG CGTAACATCT GCTTTTTTGC GTGCTATTGC GCACACGCGC GTGTCGCCTC 4320 

ATTTTCATTT ATCGGTTCAG TCGGGCAGTG ATCGCGTGTT ACGACGCATG CGACGCGCTT 4380 

ACACACGTGC GGACATTTAT CAGGCAGTTT CCGATTTACG GAGTGTGCGT GAAGAACCCT 4440 

TTTTGGGTTG TGACATAATC GTCGGCTTTC CAGGGGAAAC AGAGGAAGAT TTTGCAGACA 4500 

CCCAGCGTAT GTGCAAAACT TTGCGTTTTG CAGGTATTCA TGTATTTCCG TTTTCTGCAC 4560 

GCCCCGGTAC AGAGGCGTTT GCTATGGATG CAAAAGTGCC TCAGCGTATT GCAGGAGAAC 4620 

GCGTTGCTGC AATGCAgCAA CTGGcAGAGA AAAACTACCG TGCGTATTTG GAATATTGGA 4680 

ATGGGAGGGA ACTATGTGCG GTGGTAGAAC AGTCCGTCGC ACGTGTTTTG ACAGAAAATT 4740 

ATTTGAGCCT CCCAATCATT GAACGGGGTG GCGTCGCTGC CTCAGCAGGA TCACACGTAA 4800 

GGATTAGAGT TCATAACGAG GGTGCTATCC TCTTGTGAGT TTCTGATTCG CAGTGGAGAT 4860 

TCATTGTTGT TAGAAAAAAA CGAGCATGCA GTAGAGCGTG AGGAAGGGTC TGTGCGCGTG 4920 

TTTCCTGTCT GTCCGCTGCT GCGTGAAGTG AGTGAGCGTT TCATCCGTGC AGGATTTTGC 4980 

ATTTATGTTG TCGGGGGTGC TGTGCGTGAC TTTCTCCTGA ACCGCAGTGC TCATGATTGG 5040 

G ATCTTGC AA CTGACGCTCC CCCTGAGCGT GTGCGTATGC TTTTTAGACG CACGGTTCCT 5100 

ACTGGTATTG AACACGGCAC TGTTACAATT CTTTTCAGAG GGCATTCTAT TGAGTnGCAC 5160 

TACGTTCCGT GTTGAGTCGG ATTATTCCGA TAGGAGACAT CCGGATTCTG TTTGCTTTGC 5220 

CGCGCGCATT GAGGACGATT TGGCAAGGCG CGACTTCACT GTCAATGCTT TCgCTGCCGC 5280 

GCTCCCCTCG GGGGAAATCA TCGACGTATG TGGCGGTTAC GCcGATTTGC GTAACGGTCT 5340 

TATCTGTAGC GTTGGGGATG CACATGCTAG ATTTTCTGAA GATGCGTTGC GTCCTTTGCG 5400 

TGCTGTGCGT TTTGCAGCGC AGTTATCTTT TTCCATCGAA GCGCGCACGC GTGAAGCAAT 5460 

TtnAckcTAC GCGCTCaTAC TGCACGTATT TCTCGTGAGC GGGTGCGTGA TGAACTTTCT 5520 

AAGATGCTTT GTACTCCCCG TCCGAGTATT GCCTCCGCCT AATGGAAGAG ACTGGATTGC 5580 

TG C AC AC ACT TTTTCCTGCC TGGCGCAGGT GTGTGGGAGA AACGAAGGGG AGGAGAAGGA 5640 

GACGCAGGAC AGTCTGCAGG TTTCACCGCA GCAACGACGT GACGCTCGCA CCTTTGCTGC 5700 

GTGCGCGTTT GCTGCGTGCG ATCGGGTGCC TGCAGAGCTT GCGGTCCGCC TTGCAGCACT 5760 

TCTCTTTCCG CTGGCTCACT ATCGTACGCT TCCTGCTTCA GGAACGGGGG CAGCGTTGTT 5820 

TATATGCCCT CCTGCACTTG CTGAAGCGCG CGAGCTTTTG CGTGGCCTGA AGTACC CGAA 5880 

TTACc TGACG GCTCAGGTGT GTCACTTGGT TGCACACGCG CGTTTTACTC CACACGAATG 5940 
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TTGGTCTGAA GGAACGCTCC GTCGTTTCGT TGTGACGGTA GGTACTACTC AGCTAGAATC 
GCTGTGTGTC CTGCGGCGTG CCTATCTTGC TGATGACGAT TACCGCATAT CTGCTCAGCC 
TGGAGTGTCC TGCGAGAATG CGCGGGAAGG AAAAAAAATG CAGGAACAGT TTGAAATATT 
CGTAACACGC GTGCGCAGGG TCGCCGCACA ATTGCCTGTG CACGGCATAC GCGAC cTTGC 
GGTGAACGGA CGCGACTTGA TAGCGCAGGg CATTCCCCCA GGTCCCACCA TAGGGCACAT 
CCTGAACGCA CTGTTTGATA TGGTCCTCAT GCAACCGTCG CGCAACACGC GTCCGCAGCT 
TTTGGAACAT GCGCAGGAGA TTGTGCGCAC CATGGCGCAG AATTAGTGCG CCTTGAGCAA 
CCCGTAGAGT CTTTCTAATT CTGCTTGGGA GAAATACGTG ATCTCAATTC TCCCCCGTTG 
TAAATTCCCA CTGATGCGCA CCTTGGTTCC CAGTTGTTCC AACAATTGCT GTTCAATGTC 
TGCGATGTCT GCGTTACGTA TGCGCGCATC TAATTGCGCA TCCGTCGATG GGGAAGAAGG 
AGGAAGGCGG GTGATGTCTG TAGCAGAACC GCCGGGTGAA GGGGAAGACG CCGCCGTGTG 
CGCGCGAGCT CCAGCGTAAT CGTGCAAACT TCCGCCACGA TTAAGACATG CAACGCACTC 
CTCTGCGGCA CGTACAGACA ACGCATGGGT AACTACATAC TGAGCAACGC TTACACACAA 
CTGCATGTCG GTGAGTGACA GAAGTGCGCG CGCATGCCCT GCGCTCAGCG TTCGAGACGA 
AAGCGACTGC TGAACTTCAG GAGGCAGTTT TAAAAGACGC AGCGCATTGG TAATGGTACT 
GCGGTTTTTT CCAACCCGCT GTGCGAGCTC TTCATGACTT AAATTACCCA GATCCATGAT 
ATGTTGATAG GCGCGCGCCT CTTCCAGGGG ATTCAGGTTT TCTCGCTGAA CATTTTCAAT 
GAGCGCGATG GCAAGCTTTT TTTCATGATC GCAGGTGCGC ACGATAACAG GTATCCGATT 
CAACCCCGCA AGGATGGCAG CGCGTGTTCT ACGCTCCCCC GCGATAATGA CCCAGCTTCC 
GTCCTGGTTT TTTTCCGCAA GGACTGGCTG GATTACCCCG TGCTCACGAA TAGACGCGGC 
AAGCTCCTCG AGAGATTCCT GCGCAAAGGT GCGACGCGCC TGATGTGGAT TCGCCTGCAG 
CAGCGTGGGA TCAAGATAGT GTACAGTCTG CACACCGCCT GAATCACGAA CATCGTATCG 
ATCTGAGCTT TCTTGCAGCA GCGCGTCAAT GCCTTTGCCT AATTTATCTT TGCCCATCGC 
GTGCCACTAT TTCCCGTGCA AGCTTCTCAT AACTCCGTGC CCCTGCGCAC TGTGCATCGT 
AAGAACTAAT GGGTAACCCG TGCGAAGGGG CTTCGGATAA CTTTACGTTG CGAGGAATGA 
TAGTATTGAA CACCTTGTCC CCAAAATAGG TTGTCACTTG CTTAAC C ACT TCTTGCGCCA 
gCTTAGTTCT AGTATCATAC ATGGTAAAGA AGATGCCTCC GATCGAAAGC GCGGTATTCA 
GACCACTTTG TACACGCTTT ACCGTCTGCA GTAAGAGTGT GAGGCCTTCA AGTGCAAAAT 
ACTCACACTG CAATGGGATG AACACCTCGT TTGCCGCTGC AAGTCCATTC AATGTGAGGA 
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TACCCAGCGA GGGCGGACAA TCAATCAGGA TAAAGTCGTA CGTGTCTTTT AC TTCTGCC A 
ATATCTTTTT AAGGTAGAGC TCGCGGTCTT GTTCATCTAC TAGTTCCACC GTCGCGCCAG 
AAAGATCGAT GGAAGCGGGG ATAGCAAAAA GGTTATGCAC TGGTGTaGTG CGCAcGCtGt 
TGATGTGTGC CTTACCTGCA AGAAGATCAT ACACGGTCAA CCCTCTAGCT AAGCCGAGAC 
CCGAGGACAT GTTCCCTTGA GGATCAAAAT CAACGAGCAA GGTTTTCTTT CCTGCAAGCG 
CTAAATACGC ACCCAAGTTA ATGGCAGAGG TTGTCTTACC CACCCCTCCC TTTTGATTTA 
CGAACACCAA GGTTTTACCC ATGGTGCCGG AGTGTACTCC ACTTTCTAAA TAAATAAAAG 
GACTTCGAAA GCGCGTCTGA ATGGGAAGCT ATGCGATGGG GTAAATCCTG GTAAGGAGAT 
ACGGTGTGCA AAAGCTAGAG TGATACGATG AACTGTGCTA CCTTACGCGG AGGTGATCCT 
TTTTCTTATA AAAAAACATT GCATTGATTG TGGTGAAGCG TTTTTCGGTA TTGTAGTGCT 
CAATGCGTTG TGTCTTGCAG GGGTAGGGTA CAGTCTGTTA TGGCACCAAG GGCCTGGTCG 
GAGCGTGTTG TTTGTCTTAG TACTGGCTAC GCTGTACGCA TGTCTGTGCG CGTTCTGTGT 
CGTTCGCGGA GAACGGGGAT GTGATACTCT GGCAGATACG AATCTACGCG TCTTCACACA 
CGCACTGCGC GAGGTGTGGC TCCAGAGCCT GTGGTGTGCA CTGCTACAGT GTGTGTTGTT 
TCGAACTGGA AAATACGTGT GTACATATTA CTTTGCACGC ACGCATTCTG TATTTACCGC 
G TGC G g TATA CTCAGTGCCT GGACATACGC GCTCGCgTGC GGTGCACTCC TGTGGTTTGT 
GCCGGTGCGT GCACGCTACA GAACGCATTT TCGCCAATGC GTATATCTCT CGGCACGAGT 
ATTTTTTGAG CACCCGTGTA TTACCTTCCT TATGGTTTTG TACAGCATGG GTGTGCTCGC 
GCTGAGTGTA CCAATGGCTT TTTTATTTCC AGGACCGTGC GGCATTGTGC TTCTGTGGCA 
GGaTGtgCTG CGCACGCTCT GTTTTCGTCG TGCATGGCTT GCTGCACACG AGGGGCGCAA 
GGCGGCGTGC GCACCGCCTA TTCCCTGGGA GCAGCTCATG TGC CAGATGC GTGCACAATC 
CCGCGCGCAC ACCGTCGGCG AGCTCTTCTC TCCGTGGAAA TCGTAATGTT CATTGTGTTC 
GAGAGAAATA ACAACCCCGC AGTCTATGAG GGGGTGCCGT ACTCAGCAGG ATTTTGGTAT 
ATGTGCTCAA GCGCGATCTG TGCTTGTGCA AACAGCATGA TACTTTCTGT CGCAGTCTGA 
TAACGCTTCT CAGCAACGAC ACGCACAATG CCGCAGTTAA TGATGAGCAT TGCACAGGTG 
CGGCACGGTG TCATGGTACA GTAGAGTGTT GCGCCCTCTA GACCGATGCC CAAACGCGCT 
GCCTGGCAGA GGGCGTTTTG CTCTGCGTGC ACGGTGCGAA CGCAATGCTG CGTGCACGTC 
CCGTCTTCAT GCTGCACCGT GCGTAGCTGG TGTCCATGTT CGTCACAATG cAGGAGACCT 
CGCGGTGCGC CTGCATACCC AGTTACAAGC AGATGGTGAT CGCGCGCTAT GACGCAGCCT 



7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 



Printed from Mimosa 02/03/22 07:30:40 Page: 808 



WO 98/59034 



807 




GCGCGTCCGC GATCGCAGGT GGCACGCTTG GCAATTGCAT GACACACTTC CATAAAATAC 
TCATCCCAAC TCGGTCGCGT AACCAACTCC TCACGTTGTC CCATTaTCGT ATGCTCCCTT 
TTTCCCTTAA AGATGC AC AT GcCCTTGTGC TGCGGTG CGC AGCACACTCT GGCGGGTGAG 
TTTAAACATC GCTcaCTTCG TGCGGAATAT ACGGTAGAAA AGCAAGAGAA AGATATACCA 
TACGTATCGC ACGCTCC TGT GCCAGCGCAA TGGATGCAGT GTTTGTCCCC AGGacTGCGT 
CCGTGTAACA GTTCTTTTGC GCGTGGTCGT TTGCGCCGTT TTGAGAAAAT GTCGATAACC 
TCGCTGTCTT GCACAAAAAT GCCACCGGCG AAGTGTGcGC GATTGCGCGC AATCCATCGT 
ACGCCTCCTG GTACCCCATT GCCGACAATT AC C AGCGATG CGTTGGATTT • TACAATTGCC 
GTGATAATGT TGTTTTCTAA CGTCTTACGA TAAAAGCCGT GAAATCGACC TACCACGCTC 
AGTCCGGGAA AGG T AG AATG CACGtTACGC TCCGCCTCGA GCAGACTCTG TCCCCGACCG 
CCGAACAAGT AGAGTGATTT GTAGTGCGTG TCCATAACGT TGAGCAAGGT GATGATGAAT 
TGAAACGGGT GCCGAAAGAT AGGTGTCGGC AATCCCAAAA AGCGTGCACC GCGCACAATA 
CTGTAGGATG TGGGCAGACA CAACGCTGCG cGGCCTACCA TGGTGCGGAA CTCGTGATTA 
TGCCGTGCCT TTAGCACGTC CCAGAGAGAA AGAAATATAA TGTGCTGCGG CTCTTTTTTT 
TCTAACAGGG AGAGAATGAC GGTGCACAGA TACTCTTCAC GCACCACATC AAGGGGTACG 
GAAAGGAACT CAATGCGCGT TACCAGCTGC GTTTGTTCCA TAGTGCTGCC TCACTAAGAA 
GGAGATGTGC GCCTGCAAGA GCATAGAGCG CGGCCGTTTC TACGCGwCAs ACATTCGTGC 
GAAAGTGCAG GGGTGCAAAG TGCGCGTCAT GAAGCGTGCG CTCTTCTGTA TCAGAAAAAC 
CCCTTCAGCG CCTACTGCAA GTATAACCCC TGCGCGCGCA CATCCGGCGT GTTGTGCGGC 
GCATTTGGCT CTGTGGGCGT ACGGTGCGCG GTGCAGTGTG TAGAACGTCC TGTCCAAAGC 
AGTTCGTGAA AGGAACCTCC CACGTGTTTT TCACTGcAAT AGATACGCAC TGCAGAAGGA 
TAAGGAAATT CCTGCTCGAG CATATGCAGT GCGCCAAGGA GGCTGTGGAG CGTGTCCACA 
CGGGTGTTCA CGGcAGAACC AGACTGTTGA CGCGCTTCCC GCACGATGCG CTGCAAGyGt 
GCATGCTGGG TGTCTTTTTC ATCGCGGATG AGCGAAC GTT CTCCTCGAAT CGGTATGATG 
CGCGCAGTTC CAAGTTCTGT CGCATGCCGA ACGAtCGCGT CAAAGACGnG CGCACGGGGC 
ATCCACTGAA GCAACACAAG CGTTACCTGC GTTTTCTCAT CCACTGGGGG CATGGGCGTG 
CAGTGGAGGA TTAGTGAACG CGTATGTATG TCGAGAGCGG ATACTACTGC GCTGTACTGC 
GCACCGGCCT GCGTACGCAG GGTGAATGTA TCCCCcAgCG TGCGCGTcGA ACACGCACCA 
TGTAATGAAA ATCCTTGCCT GAAAGCTGCA CATACCCGCC GGAATCCGGT TCTGCGGTTA 
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TCAGCAATTG TTTCATAATA TCnCaCGgCT 
CCGTAAGGAA CTTCTCTCAC CGTAAACCCT 
CACGCACAGT CTGGCTCCCA TAGCCTGGGC 
ACCAAGTCTT TGCAAGGGGT ATAACAAGGG 
CTTCTTTTTT TCCTGTATTC CCTCTTGCAA 
CGCkTGaATA TCCTGTGTGC GTATCAACTC 
TAAGTCATAT ACAAAAGACG CATTGTGCCG 
CAGTATTGAT CGGAGTTTAA ATTCGCGTGC 
GGAGTTCATG GTACGGTTCT GCTCTGCAAA 
GACAAGTGCC TCTTCTTCCT CGGGAGTAAA 
AATGCCGCtC TTATCGATAT TTGCGTCGCT 
CAAACTCTCC CGTTCATTGA GGTCAAAGAC 
TTGGCCGACC AGATATGCGC GCTTGTGGTC 
GGACGCAGAA TCCCTGTTGA TGAGAACAAT 
ACGTGCGTTA ACGCTGAACG TTATGGAGTG 
GGTTCCGGAG GGAATGAATG AGCTGGTAAC 
ATTGTTGCGC AGg TCGAGAA TGAGACGGTC 
TTCTACCATG CGGGTAGCGG TGACTGGGTT 
TGGATCGATT TTTGCGTACT TAACAGTGGG 
TGGTCGGAAA ATCTTTGTAC CCCGTTTGAC 
CCGTAGTTTC TTTAAAACCT CGTCCATAGT 
GACGATGAGG TCCCTCGGTC TGATTCCTCC 
GGTCACTTCC ACGTAGGCGG GTTTGCCCGG 
GCCGATGCCG CCAAAGAAAC CCTTGGTAGT 
GCTTTCGACG AAGGTTGTGT AAGGGTCCTG 
CTTGTAGAGG ATGTGGGGGT CTACCTCGTC 
TCCTGCACCG TCTGCATGCG CTCGTCTTCC 
CATGTGGGGA AGGGAGCTGC GCTGGTGATC 
AGAACACTCA TGGGCGGATG ATGGAATGCG 



808 




13041 



GAGcGCGCCG GACGCTGTGC GACAACACCA 11220 

ATACATATCT GAAAgTAGTG CGGCGCCGAG 11280 

AGCGGGACCG ACCTGATAAG GgCCACACTC 11340 

CGCAgCGTAc rAACTGGACG TTCAAGATAC 11400 

TTCTTTGACC GTTTTAGTAC GCGCCACCAG 11460 

CACCGCCTTT CGCAGCTGTT CATCATATTC 11520 

GTGATACTCC TGTGCGATGA GCTTTTCCAC 11580 

CAGGCCACGT GCATACTGTT TTATGCGTTC 11640 

GAGTTTCACT TTTTCAGACT TAAATAGACT 11700 

CTCTGTTTCT CGTACCTCGA GGTCCGGCGG 117 60 

TGGGGTGTAG TAGCGTGAGA TAGTCATTTT 11820 

TTGCTGCACG ACTCCCTTTC CGTATGTGGT 11880 

TTTTAGCGCA CCTGAAACGA TTTCTGAAGC 11940 

CACAGGCATA GACGGAGGTA GCTTCTGCGC 12000 

TCCCTGCACG CGTGACTTAG TGGTTACAAC 12060 

GTCCACTGCG GCGGTGATAA GGCCGCCTGG 12120 

GCAACCTTGC CTACGCAkTT CGGTAACGGT 12180 

GAACTCTATG AGGCGCACGT ACGCGATGTT 12240 

AATTTCAATT TTCGCACGGG TGAGGGTGAA 12300 

AACAAGAGTC ACCTTGGTGC CTATCGGCCC 123 60 

CATGGTGTCA GTGCTCATGG TGCCGATTTT 12420 

TTTCCAGGAA GGAGTGTCTT CAATGGCGGC 12480 

AGTGGATGTT CTGGACTTAG AAATGACAAC 12540 

GTCTTGGAGA TTAACGCCGC TGATGCTGTC 12600 

AAGAGAGTTA ATCATGCCTT CTAGTGCCCC 12660 

GACGTAtATT TGCGGAGGAA TTCGTAGACA 12720 

TCGGAAGACT GTGGCAAATA AGCGGCAGTC 12780 

AGACAACAGA CGAGGAGCGC AAACAGTCTC 12840 

TGCCCGTCTG TGTCAAGAAT CTGCGTTGGA 12900 
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CAGTTTCCTT TCCATGCGAC TAGACGGAGG AGGGCACTGG 
AGACCCGCTC TGGGAGGAGC AGGATATAGC GTGACTTCCG 
TTCTTCGTTT TTCACCCCTG TGCATGGGGG GGGGGAGAGA 
TCGCACCCGT TGCGTTGCGG TGCGTGCAAG GACTGTGCTA 
CGTGATCATC GCTCTTGATG GACCTGCAGG CTCTGGGAAG 
CGCGTCTCGC CTTGGCGCGC AATGTTTGAA CACGGGTTCT 
CGcCGCATTG CGTAGGGTAT CGGAGTTGGC CGTGCAAGCG 
TGATGCGGCG GTCGGGTGCG CGGCTGTTCC ACACGCAACA 
TCCTCTGACG GCCCAGAAGA AGGTTGCACT TTTTGATGAA 
GCGCACAGTT GCGCTTTCTT ATCGTGCGGG TGTGATGTAC 
GTCACTGCTG CGTTCGGATG AGGTGGAGTC GGCAGTCTCG 
TATTCGGGCA ATTATGACGG GGAAGATCCG GTCGGCCGTT 
TGAAGGGCGT GATCTAACGA CGGTTGTGTT TGTGGATGCG 
CGCTTCTATT GAGGCGCtGT GGCGCGTCGT TGGGCGCaGG 
CAGGAACTCG AGCAGGCGCA TGCGCGCGAC GTGACGCACA 
GGGGGGCTCA G 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



GGGAGGTGCA GGCGCGCCCC 12960 

TTTTCTGCCC TGTACTTCCC 13020 

GAGAGAAGTT TCGTTCATCC 13080 

CAGTGCCGGC CGATGGGGAC 13140 

AGCAGCGTCT GTCGTCTGCT 13200 

TTCTACCGTG CATTTACCCT 13260 
TGCTCTCCTT CTCCGGACCC - 13320 

AATCTGGACA CATCATATGC 13380 

GCGTATTGGG TTTCGTTTGC 13440 

GTGGGCGAAG AGAACGTGGA 13500 

TACTTCGCGG CAATGCCGGC 13560 

TGTGGTGCGC GGGTAGTTTG 13620 

GACTTGAAGT GCTACCTTGA 13680 

GAACGAGCCG GTTATCGAAg 13740 

CGACAGCGAC GCnCACCGTG 13800 

13811 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

TTTC CATACG CTCGCTCCAC ACCTTTGCTT TCTTTTTTAC CTCCTCTGCA TCTTGTCGCT 60 

CGTTGAGCAA CAGGTGGACC CTGCGCAGCG CGTGCAGGTA GCAGACAGCC AATTTAAAAT 120 

CTTCAAAACT GGAGGACGTC ATCTCATACT TTTCCCGGTA ACGCTCAGCT GCCTGCAGGA 180 

GTAATTTCTT TACCAAACGC AGGTGATAGA CCACTGTCTC GTACTCGTCA GCCTGTGGAT 240 

TCAATCCAAT CTCAGAAGAA TTCGCTAAAT CCAGAATATT CTTCGCCACC GTTGCGTAgc 300 

GCGCCTGGAG CTCTACAAAA GACCAATACC ATTTAGTATT CGTCCCATAG GCGCTAATAA 360 

TGGAATCAAT GGCGAGTCCC ATCTTACGGA TAAGATAATA ACGCTGCTGC TGACTTACGT 420 
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TCGCTATCTG CGCCACCTGT TCCTGGTAAT CGGAAAACGG CGTGTCAATT AAATTGGTAA 
CAATTTCTTC AAGATAGATG AGCGCCTTAT AAAGCGTTTT CCGCGCCTCA TTCAGCAGAT 
CCTCCTGCTT ACCACCTACT ACTACAACCT GTGTCTGATA TTTAGCAGCG TACAGCGTAC 
TGAGGTAAAT CATGTCATCT ACCAACGCGA GCTTCTTGTA CGCCGCGCCC GTCCCATCCC 
GCCTGATCAG TTCTAAAATG TTTTTTTCAC GCGTAAAGAT TTGATCAATA GTCCCTTGGT 
ATACGTTCAA CTTTTTCTGG TACAGAGCAT TCTGCTCTTC CTCAGTCACC GCACTCCCCC 
CTTCTCTGCC GCGCACCTGC GGCAACAACT CCTGCGCATG CGCTAGGGAG GTTGCACTGT 
GCGTGTCCCC TGCTAGCATG CGCGCAACTT CTTGAACCCG TCGTTCCCCC ACTACATGCG 
CCGCGCTCGT ATTCGTGTGT TCTCCACTCG ACTCTTTTTT CACACACACG TGCGCATCCG 
CGTGCGCCGC TATCATAGCC AAATGCGTAA TGCACACAAC CTGCTTGTGC TCAGACAACG 
CTTGCAAATG CTCTGCAACC GCACGCGcCG TTTCACCTCC AATTCCCACA TCAATCTCAT 
CAAAAATCAA CGTGCCCACT TCATCGACCG ATGAAAGCAC AGTCTTTAAA GCAAGCATCA 
CGCGGGAGAG TTCCCCCCCT GAAGCAATCT TTGCTAGCGG ACGCGCAGGC TCTCCTGCGT 
TGGCGCTAAT TAAAAACTCA ACGTCATCAA AGC 
(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 946 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 
GTAAAGCAAA ATCACTCAGG ACAAGACTCA CTTCTTACTT TCGCTGCAGG CACGACCCAA 
AGACACGCGT GCTGATGTCG CGCGCCGCTG CACTTGAATA TC TACAAAC A CAACACGAAT 
ACGAGGCGCT GCTCCTTGAG AACAC AC TC A TAAAAAAACA TACTCCGCGC TACAATATCT 
GCCTCAAAGA CGGGAAAACC TATCCTTTGC TCAAGCTAAC CTGCGAGCCA TTTCCGCGTA 
TTTTTCGCAC ACGCCAATTC TGTCAAGACG GTGCACGGTA CTTTGGTCCC TTCCcTGACG 
TGCAAATCCT CGATTCTTTT CTTAAACTCA TTTTACGCAC CTATAAAATC CGTACGTGCA 
CCACcTTGCG GAAGAGAAAA AATCCTTGCC TCTACTATCA CCTGAAGCGC TGCGATGCCC 
CGTGTTGTGG ATGGGTCTCT CCACGCACAT ATCAAAAGGA CATACATGAG ATTACCCTGC 
TGCTCGAGGG GAATATTGAC GCGACTGTAG CGCGTCTAGA AAAGCGCATG AAACGAGCAG 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1233 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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TCCGCCAAGA AGCATTCGAA GCTGCCGCGC GCATACGCGA TGATATCCAG GCAATCCGCT 
GTATTACACA CAAAAGTCTT GTTCAAGACA TGGACGAACG TGCACGCGAT TACATCGCCT 
GGTCGAGCAC GGGAGCAATC GTCACCTTCG CCGTTCTACG CATGCGGGGA GGAAAATTAA 
ACGGTAGAGA ACTTTTTCGC ACACGTTCAT TAAAAAATGA AGAGGAAATC CTTTCAGAAT 
TTCTCATCAC TTAC TACTCT GACCATACCA TACCCCCACA TCTATTTGTA CACTCGTCTG 
CAGGGTTAGC AGAACACTGG CTCAGCCATA AAGCAGGTAC ACAATGTACC GTCACGCTCA 
TCCCTTTGCA TACCTTTCCT ACGCCGCAGA CCCCTTCTTC CACTGTCACC ACAAACGCTC 
CTACCCTTGC AGCTTCGCAA AATAGCAATG CAGTACAAGA TTCAGGGTTA CGTTCTTGCA 
GCGAAACGTC CACCATGCAT ACGCTTCAAA AAGCACACGA CGCCTGCACT GCAAGCGAAG 
GCACACGAGA AAACACACCG CACGAGAGCG CGCACACTCC TCATCACCGC GCCATTTTAG 
CCATGGCGCA GTTAAACGCT CATGAAGATA TTACTCGGTA TCTGAAAAAT CGCGGCGCTG 
ACGATGCACT CAAGGAATTG CAAAAGCAAC TGCATCTTGC ACGCATTCCA ACGCTCATTG 
AAGGATTTGA CATTTCCCAT TTGGGTGGAA AGTACACTGT CGCAAGTCTC ATTTGCTTCA 
AAAATGGGGC CCCCGACACA AAGAACTACC GATTGTTTAA TTTACGTGCG CACGACACCC 
GTaTTGACGA TTTTGCATCG ATGCGCGAAn AATTgCCCGC C G TT AT AC C C AC AC AC C AGA 
GGGCTACACT CTGCCCGATC TTATCCTTGT CGATGGGGGG AATCGGTCAC GTTTCTGCTG 
CACAGCACGT CCTCGACGCT CTTGGTCTTA GTATCCCGCT TGTAGGTCTT GCAAAACGCG 
CAGAAGAGCT ATTTATCCCC AATTCTCC TA CACCACTAGT TCTGGATCGT CGCAACCyTG 
CACTGCATAT GCTgCAACGC ATCCGAGATG AAGCACACCG cTTTGCAATC ACACGGAATC 
GGCATCTACG CACAAAGAAA GAGCTAGTCT TAAGCTTTGA GCGTCTCCCC CATGTGGGCA 
AAGTGCGCGC ACACAGACTG CTTGCTCACT TCGGTTCGTT CCGCAGCCTG CAGAGCGCAA 
CTCCCCAGGA CATAGCGACA GCCATTCATA TACCGCTCAC CCAAGCACAC ACCATCCTGC 
ACGCGGCAAC CCGCTCAACA ACCGCCCCTG TACGAGAAGA ATATAAAGAA CACGAGCACG 
ACCCCCAGGG AGAATCACCT GGACCAGGTC GGAAAACAGA CTAACGCGCA CCCGGCCTAC 
GACGACGCAT CCAGGAGTCG CTCAAGCTCA TTCTTTCCCA GAAGCTCGAG CGGACGATTT 
TCTGCAAACG AAAGCGCAGC GcgTGAAAAC GAAGAAGACG TCACCACCAC CCCACGCGCT 
ATACCACGCT CCTTCATGCG CTCCAAAATA GCGCGCAGGA AGCTGTCATC GAGCACCCGC 
GCTTCCCGGT AAAAAACAAC GACTTTTGGC TGCGTACGCA CGTTGCTCCA ACCCTTCTCG 
TCCCTTTCTG TGGCCACTAT TTCACAACAC CCGCGCACGT CCTGCACCGA GTCTATGGAT 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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1860 
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2040 
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2280 
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AAGCGAAGCG CcTTACTCAC AATTCGCTGA CACAGATCGA 
GTCAAATACT CCTTCATCCG GTCGTTCAAA CGTATATCCT 
TCACATCCTT AAAACCCGCA GACCGTGCGT TGATAGCCTC 
CCACATCCCG CTTTTTTTCG TAACAGGAGG CTAAGAGATA 
TTTCACCCGA CTGCGACTTG CAGCGCAGCg cAcGGTCAAA 
CATCCTTCGC ATCCATGTAG CAACAGCCAC GCTCGGTAAA 
GACTCCTCGA CGCCTGCTCA AAGGAGCGAA GTGCCGCCGC 
AGAGCTGCCC CTGGCAAAAG AGAGCCTCCG CATTTGCAGG 
CGAGCTCTGC CCTTGCTTCT GCAAGATGAC CAGACCGAAG 
CATGCGCTGC AGGATGCGCA GygTCCATAC CGaTGCACTG 
GATCGGTCAC ATTCTTGCGT TCAAACAGCT GCCCAATACG 
TAGGGT 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1905 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



AAAAGGAATC GCCATCGAGG 
GATATTGCGC GAGtTTATGG 
CCACTGCTCA ATTGCCTTTT 
ACGTATCTGC AAATCCTCAT 
CTCAACAATG GCCCGATCCA 
ACACTTCTGC CGCAGCACAG 
ATACTCCCTT TCTTTCCGCA 
TTCCAAGGCT AGCACAGTAT 
AAGCAACGAA CCGAaGCGmr 
GcGATAGTAT GCCGCAGCCC 
G TAG TG AT AA TCTGCACACG 



13041 

2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2946 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

TACTGAAAAA CCGACCCGAG TAGCTGCCTC CCCCATAATT CCcGTAGACC aTAGGCTGAa 60 

TAAGGATGGC AACTCCaTCT TCCCCaTCCT CCTCAAACGA C a GC AG TTTA CTAACAaGCC 120 

TTACCGCTTC TTCAAGCTGA ACGTAGGCAT CATCGAAAAA GCCGTCTGGA aGCAAGGCTC 180 

GATACCGATT CATCGCCTCG CGCCCAGAGG GAGAAGATTT TCCTCCCTGC AAAAAAACCT 240 

CTATTTCCTT CAGGGTTTCG ACAAACTCCG AAGCACCCTT GGTGTTCTCC TCACTTTCTG 300 

CGATACCAAG CAATACCCCA AATACACCCT TCAAAAGGAA GAACACCTCG TGCGTTGCAA 360 

AATGCTCCCC GACACGCTCG GCAAACCCCG CGAAAGTATC CCGTGTCAAA CCAAAGTTAT 420 

GTAATACCGG ATAATTCGAA ATGGCCAAAT TGGGGGATAA AACCACCTTG AGGAGCATAG 480 

GATTCTTTGC ATCCGCATAC TCTTTTCTGT TTAAAAGCGT AAACTTCCTT AAATACGGAC 540 

TTAGCGCGGA CCGAAGCTTC TCCCCATACA GACTACGAGA GACTGTCGCA TCAATAACCA 600 

CGCTCGGAAG CACGGGCAAC CCCAGACTTG ACAACTCATC CGCCTGACGA CCCCGAATCC 660 
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CCAAAAGCCC TCGGTCAAGC CTCTTATCGA GAGCCTCTTT ATTGCTCAGA AAATGAATGG 720 

ATTTGGCAAT GTTCATTGTG CACTCCTAAA ACAAATTCAA CACCGTGTCT GGGGAACCGT 780 

GCAAAAACAC CTCAAAACGA GAACTCGCCC CCTCCTTGAA AT AC TT AG AG AGCTTGGACA 840 

AATCCTTAAT TTCCTCACCT GACACTGCAA ACTGAATCAC ACTCCCGTGC TTCACCTTTC 900 

CCCATTTGAA GAGAGTGTTA ATATCCACAA TACGCTCACC ATCGTAGAAC ACAATGACCT 960 

CTGAAGAAGG GTACCGCGCG TTGTAACTCC TGATAATACG CTTCCACGCT TCCACATTCC 1020 

CGTTATGAAA CAACTCGTTA GAAACAGGCA CCGATATTAA CTGAGACATC CGAATCGGAC 1080 

CTGcAGAGGC TTGAGCAGGA GGACGCGCGG TAGGAGACAC AGACTCAGAC GAAGCCCGAT 1140 

CACAGGTAGC CCCTTCAGAC GCCCTGGCAG CGTCCGCACC CGACTTCTTT GCCGATCGAA 1200 

CCCTCTTGCC AGAACCCTCA GACTTTTTCG TAGACACAAA TGCAAAAACC CCCGAAAGGA 1260 

TTTCTTTGGG AACCACGAAC TTACGATTCT CTATCAGAGC AATCAAACCC TTGGCGrCtT 1320 

CGTCTGCAAT CCGTTCATCA AGCGGTCCCC TATCCTGCTT TCCCACGTAC ACCAACAAGA 1380 

GCTCATTTTT CCGAAAACCC TCGACAAGGG AGGCGTTCGA AGGATTCTTC GGGTTTATTG 1440 

CCAAAAATCC AAGATCGGGG TGATGATAAC CAAGTACAAT ATCCACTCCC TTCCACCGCG 1500 

CAGTTTCCTG AGCAAACCCG AGACCCTCGA CGGAAACCTT CCTTAGATTG TAAGAAAACA 1560 

GCCGATACCC CCACGTGTCG ATGAGTAGCA CCGACAGGAG CGCTGCAACC TGCTCGCTTT 1620 

GCATCTCcTT CGCACAGAGC AGCAGGTTGA CCTGGTCTGC GACGTTATTC TTCGGATCAC 1680 

ACTCATCCAA CACGCTCAAA ACGCCCTTCA CATGACGCAC TAGAGCATTA AAGTTCTTCT 1740 

TCACCGCTGC AGGCACGTAA AAACTAGATG CACTCACCAC GTAACACTCC TTTGATGATT 1800 

AAAAAGCTCC CCGAAACAGA AAAGTTCCAG GGAGCTTCGG ATCACGACAA AGTCTATCTA 1860 

CAATCTCGCT AGAGTTCGCG AATGGCAGAc CGTCTACCAC GTCGA 1905 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1370 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 
AGGATCTTTT CCTCAGTCAA CTCCAAACGA TCTCCATCTA CATCCAGATA CAGCGTCGTC 60 
CCCTCGAGTA TCTCCTGAAT TTCTGCAGAA GACAACCGCT CAATGCTGAG CGCTGCACGC 120 
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TTTGTCTTCG ATCCAAGCTC TTTTCCGAGC ACTCTAAAAT TCGCTTTTGC ACGGTACTCT 180 



ACTATCTCGT CTTCTTTCTC ATGAAACACG AGCTCTTTTA CATTAAGCTC ATCAAGCACA 240 

TCTTCTTCCA TTTCAAGAAG TGCTGAGCGC TCCATCGGGT TACGCGTAAT AACCTGCATA 300 

GCTTTAAGTG GCTGGCGTAC TTTGAGGTTA CACTGCGCTC GGATCGCACG CGCCATAGAA 360 

AC AAC TCGCT GCACTGTTTC CATTTTAAAC TCGAGTGCAT CGTCTCGCAC CATTGGTGTA 420 

CAAACAGGAT AGTCTGCAAG ATGCACAGAc TGCACATCAT CCGCGGCGCG ATTATTCTGC 480 

CAT AT AC TCT CGGTGATGAA TGGCACCACG GGAGCAATAG CGAGCACACA TCTTTTCAGC 540 

ACGCAATACA ACGTGTTGTA CGCACATCGT TTATCTTCAT CGTTGATGCT TTTCCAAAAT 600 

CTCCTTCGAG ATCGGCGGAT GTACCAGTTG TTCAGCTGAT CTACATACGA AACGATAGGA 660 

TCCGCAACTT TCGATACATC GTAAGCATCA AGTGCACAGG CAATGTCTTG CACCAATTTT 720 

TCTGTCAGCG ACAAGATCCA ACGATCTAAC GGGTTATTCA AATGCGTCGC TAAACGCGTG 780 

ACCGCCTGaC CCATTcCGTc AACTTTTGCA CATACAGGcA GGATCGATAC CATCGATGTT 840 

CGCATACGTA ACGTAAAAAC TGTAACTATT CCACAATGGG ATAATCACAG TCTTCAAAAT 900 

ATCTTTCACC CCTTCGTCAG AATATTT TAA ATCATCCGCA CGGACAACCG CAGAACGAAC 960 

AAGAAAGAGG CGGACGCGTC ACACCGTAGC GATCCATGAC TTCATTTGGA TCCGCATAAT 1020 

TGCGCAGGcC TTGGACATCT TCTTTCCATC AGACGCAAGT ACCAACCCCG TAACGATACA 1080 

GTTTTCAAAC GCAGGACGCT CAAAGAGTGC CACAGCCAAG ATGGTAAGGG TGTAAAACCA 1140 

CCCTCGCGTT TGATCTAACC CTTCAGAGAT AAAATGAGCA GGGAAATACC GcTCAAAGTC 1200 

AGTTGcATGT TCAAACGGAT AGTGcTGcTG CGC ATAAGG C ATTGcACCAG ATTCAAACCA 1260 

ACAAT cTAGC ACCTCAGGkA a cGCGTCGCA TcACACTCCC AcAGG sGc AA GGaATTGTTA 1320 

CCATATCTAC AACGTGcTTA TGCAAATCTT CAAGCAACAT GCCGGrAgTT 1370 
(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
ATGAGCGATC TCTTTTTTAG AAAGAACCTG TCAAAAGGAC AAACGGAGCA TAATGCACAT 60 
CGTAAGAGGA AAATAAAATT TCTTGTAGAA TCGGATTGAC AACTCTCTAT GAGGCTTTGA 120 
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CGCATATTTT AACGTATGAA GCGTCTCTTT TATTTTCTGG ATGGACATmC CTTTTGTTAA 
AAAATCCTGA TGATCAAATA ATTCAAGCGG TTCTCTCAAC TCGAAAGAAT CAAGATATTC 
AGAAACCATA AGTTTCATTT CATGAGTGAT AGTATATACA ACCGGTTTTA TArGAACCCA 
CCTGTCTTTC TGCCAATGTG CAGTGTGTGC TTCGATAaGC TTCTGAATAG TACCGTTACT 
ATTTTTCATA ATAACTAACA TATCAACGAG CACCTTTTGT CCTCTGTGAT ACGCACCCGC 
AATGGAGATT ATCTT AC C AT CAAAAGAAGA AGCAACAACG TCATTTTCAC TATTCCC CTG 
AGCATAGCTC ACAGTACGTG CAATGATTTC ATCCTTTTTT GACTGAGAAA CAACTACCAC 
ACTATTGTCA AATGCAAATA TTCCAATTGC TAAGAAACAC ATAAAACCAA ACCACGGGCC 
TACGATACTA AAAAAAGATA ACCCACCGAA ATGGATGGCA AGAAATTCAT TTTGGACATA 
CAAGATTCGC AC AG T AT ACG TTGCAGCAAA AAGCACAGAC AAAGGACAGC ACAGCAGTAC 
ATAATGTGGG ATAGATACAT ACCATATCTT CACCAAAGAA GAAAAACGAT ACCCATTCGA 
AATAnAmTGC GTAAGATTAA CCAGAATATC AATAGCATCG AGAATCATAC TCGCACACAC 
GAGCAcTAGA AAAAAAATTG GTAAGAACAT GTCCAACATA TACACCTGCA GTATTTTCAT 
ACTGCAGATT TTCTCATTGA CAAAATCAAT GCAGTAAAAA ACAATGCGGC GTTTGGAATC 
CACAGTGCAA GGGAAGGAAC AATACTAAGC CTCAACGCAA GTTCTTGTCC TCCGATAGAG 
ATAATCCAAT ACCCTAGTAC AACAATAAGC CCTCCTACAA ATCCTTTCCC CTG 
(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 884 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
TTTAAAAnTC AGGGAGCCCG AGAATTGTCA AAAATATCCT CAAAAATATn GAGCGTTTGA 
TCAAGAAAAA TTCCAGGCAT TTCTGCAACA AATCCTCAGA aACGTaAAAA CGCAGGAAGA 
CCCCCGcGTG CTCGACGCCT ATCGGCGCCT TTTTCGCAAG AGCGTCCCCT TCTCCATGCG 
CTCATACGTG GCCGCACACC TGGCGCACAC GCACTGCCGC GCCGGCGCCA CCGCCGcAGC 
ACGCGGCGCA CCGGAGCGCG GGAAGGTATA CGTACCGGCA GCGCGCACGC GCTTTCCTGC 
GCCACGCGCC TGCCAGCGCC GCCCGCGCGC GCGACTTTGA GCGCAAAGcG cGCgGACTAC 
CCCGCCCTCT GCCCGGGGGA CACCACGAGC ATCTTCATCA GTATTGGAAA AAATCGGCAC 
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ATCTATCCGC GCGACATCAT CGCCCTGCTC ATGCAGCGCG CCGATGTTGC ACGCGAGCAC 
ATCGGCACCA TCCGCATCCT CGACCACTAT TCCTTTATCC AAGTCCTTTC GGGTGAGGCA 
GAAGCGGTTA TCGCCCGTTT AAATGGCCTC TTTTACCGGG GGCGCACCCT GAcGGTGAGT 
CACTCACGCA GGGCGGACGA GCATCCCGCT CCTTCTACAG AGCCGCACGC CGCTGCCGTT 
GCACCAGAGC CTGgACTTTA TGGgCAGAGC CcATCCCCGg CCCTGGgAAG AATAAGGgAA 
CATGgCCGTt CAGTGcAGGG gCGCCGgCCc GGGGCGCGGT GGGCAnTTCG GTGCGGTGCG 
GCAGTTTTCG GTTGACAGGG CTGCGCGTGG GAATCCTACC TGnAGCGGCG CTCAATGGAG 
GCGCACGTnC AGGGCCGTTT GCCATTATGG GCAGCCCGCT TTCT 
(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3247 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 
GGGGAGGTTT ACATGCGTAA TACCGGATCC GCCACCTACG CCCGCATCGG CGGTGTGTTC 
CTCTCTTTCT TTGTGCTGTT ACCCGTCTTC TGCCATGGCA GCAAAGAGAA GGGAAAGGAA 
GAAGAACCGG TTCGCCTCTC AGTCCTCATA CGAGAGAAGC ATTACTCTTC GGGCCTGCAG 
AATGTGTTTA CGAAGTTGGA ATTGGAAGAA GGAATCGCCG TCACCGTCGA AACCATCCAG 
GACGATCAGT ATCCTACGGT GCTTCACGCG CGCCTTGCAG ACGGGACCGC TCCGGATGTT 
GTAGAGGTGT CTCTTCCCTC GCTCCATGCC CTTGACCCAT ACCTTTACTT TGTAGATCTG 
AGCAAAGAAG CCTGGATACC GGATCTACTG ATTCCTCCCA CAGATCCGTA CGGCAAGACA 
TTTGCGCTTC CCTTAAACTG CGCCGTGTCT ATCAATG C AC TTTTCTACAA CAAGGACCTT 
TTTGATCGCT ACGGGATATC CGAGCCCAAA AGCTGGAATG AACTCCTAGA AAGCTGCGCT 
CTCATTGTAA AAAGTGGCAT TTCTATCGTA CCCCTCGCGC TCAGCACAAC GGAAAGCTTT 
CCACATACGT TGCTTGCTGA CGCGATTACG AAAGTGCTCG GTGAGCAGGG CGCTCGAGAT 
TTAGTCAAAC GTGCCACAGA CGACTCCATC GATTGGACGC ACGAGCGTGg CtCTATCCTG 
TACTCGGAGC CTATCTGGAA CTGTTCAAGC GGGGATACGT AAACAAACAC CACCGGACTG 
CGCGCGTGCg GAAaTCATTC ATGATTTTAC ACGCGATCGC ATCGCTATGT ACTTTGGCAG 
TCACCTGGTT GCAGATGCAA TCATAAAAGA ACGTCCTGGA ATCAACTTGG GCGCGTGCGT 
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CCTCCCTATA ACCGAAAATG CACAAGACGT ACTGACTGGA AGTTTGGAAG TGCAAGGACT 960 

CGCAGTGCAC AAAAAAAGCG CGCGTGTGGC AACCGCGTGT CGTGCACTCT CTGTGCTTGC 1020 

GTCTGCCGCG TACCAAAACA GTTTCTTTGA AGAACACAAA GGGCTTCCTG CGTTTCGAAA 1080 

CACCACCAGC GCAGTTATTC CTGCGTGCCT CAGTGcCCTG TTTAAAAGCC ATATAGAGAA 1140 

AGGAAAAGTA ATACAGGCAA TCGACGCGTA CGrCAGGCGC AAAACACACC CCACAGAGCC 1200 

TCTGTTTTTC CAGATTTCGC CGCGTATGTA ACCGACCCGG CACCAACTGC GCACACCATG 1260 

CTGCACCGCG CCCAAACTGA AGCGCGGAGG AGAAGAGAGC CGGTACAAAA AAAAGAATGA 1320 

GAGCTCCTGC GCGGGCAATA CGTGCCACGA GCCCGGCAGC TTAACCTACC GCGCGACAAG 1380 

ACAATCTTGC CACAGCCGGT CGAGGTCATA AAaGGC gCGC ksCgCGyTcA TAAGATGAAC 1440 

CAGGATAGAC CCAAAGTCCA GCACCCGCCA TTGCTCTTCG CAAAGGCCTC GTTTTTTTCG 1500 

ATGTACTTCT CTTAGGCCAA AGCGAGCCGC CTGCTCGCAC ACCAGACGAT GAGTGCCGTG 1560 

CAAGAGGCCA GGCACAGTGG CAACTACCGC AAAGTCCGCC CAGCCGCAGC GCGCGCTTAC 1620 

ATCAAACACA CATACATCCT CCGCGCGCGC ATCACACAGA GCCTCTGCTA CCGCGGAAGC 1680 

AGCTCCGTTA GCACTCACGT TCCCTCCTTT ACCAGATATC CATTAAAATC CTTCCCCAGG 1740 

ATAAGCGTAA AATCGATGCC CGTTTCCACC CCATACTCAT CAAGCGAGGC GCTAGTTGTC 1800 

TCAATATTCT GACAACGAAT CACcTgCGCC ACCACTTTAG CCACTGCAGG ATTCCCAATG 1860 

CGATCGACGA GCACCGTCTT TTGCACACTT TGCTCCAATG C ATT AT CAAC GCGAACTACG 1920 

TCGTAACCAA ATCCTTGGTA AATATTCGCC GTCGTGCGCG CAAGACCGTG CGATTCAGTT 1980 

CCGTTAAGAA TTTCCAACGC ATACACACGC TCAAAGGCCG TACCATTCTC TGACGCAAGC 2040 

ACCGCAAGCG TCTGGCGC AC GATTTCCTTA ATTTGCTGTC CATCaCGAGA TGGAAAAAGG 2100 

AGTACCTTGC CGTCTACCAC TCGTTTAGTC CCTGAAAAAC GCTGCGGCAC TAGGCGTTCT 2160 

GAATCCAACT TAGATAATTC ACCTATAAGC TTTTTTAAGT CAGCACGCCG AACGTTAGAA 2220 

CGTATTAGCC TGTTCAAGGA AAAAGCACGA GTTGAATGAA CAAAAAACTC ACTGTGATCA 2280 

TTAACACTAC GCAACAAAGC TAAAATAGCT TTCTGTTTCC TCGATGCTGA CTCCCCTTCT 2340 

CCCTCATCCT CGTCTTCGTA TAAAAGATAA TCACGCATCT TATCCCCATC CAAAGACACT 2400 

GATCCTGACG GTAACAGGAC ATGCCCTGCA CGTTCTGTGT GCACGTCGAT GGGCGTAGGG 2460 

ATAAACACCG ACAGACCAGA GAGTAAATCT GTCAATTTAG AAAAGTTATC CAGCGAGCAC 2520 

ACTACAGAAA AAGGAACGTT AATTCCCGTT AACTTTTCTA CCTCCCTTTT ATACTCCTCA 2580 

ATACCACGCT CGCTGTAAAG CGAACCGATG CCATCCGTAC GGCCAAGACT CTGCAGAATA 2640 
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AGTCCCATAT TATGGGGAAT ATCAAACATT GCCGCGCGCC TCGTTGCAGG ATAATACGCA 2700 

ACAACATTGC TGGAGATTGG AACGTTCTCG TGTTCAATGA CAAACAACAC CTTGAGAATA 2760 

TTGTCGCTAG AAAGTGAGGA TTCAAGCGGA TCGCGCTTCA TACCAAAGAA GACTGCAAAG 2820 

ACGGTGATAA CCAG CAT AAA AAAAATGAGG AGTAAAAAAA GCCCATGTCT TCCCATATCC 2880 

AACACTTTCA TTGCACACTC CCTGGCCTCT CGTCGAACTG CGGCAGATAC ACACACTGCC 2940 

CTACTTTTGC CCGACTGCCA CCCGTTACGC GGGTTGACTT ACATCCTTTC CCAGTGCACG 3000 

CAACATCGCA CACGTACGCG GATGGGGCGT GGTACGTTTT CGCTCTGCGC ACACAATACT 3060 

CGCACTCACT ACTTTTGCAG CAAGCGCATC TAACGTTAAC GTTGACACTG CACCGCGCAA 3120 

gGACGCCGCC CAAGCACGAG CTGGcTCAAt TTTGTCTGAT ATAAAAAGAA TCTTTCCTAA 3180 

CACACCAAAG TCTTCACACC CAAAGGnATG CCAGCGCACC GCAGACAACA ACACCTCATC 3240 

CTGnACG 3247 



(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

CGAGTGCTAA AAGCTCATGA CACAGATCCG AGCGATCCTT TTTTaTGGCA AGCACCAATG 60 

CCGAAaTCCC ATCCGATCCC GTTTGTCGAT TGGATATCCC TTTGCCAGTA AAAGCCGAAA 120 

TGTCTGCACG GGCACATTTT CCATAACCGC AAGATGCAGT GGCGTCTGCC CGTCAGAATT 180 

TGCCAGGAGC GGATCCTTCC CCACGACTGC ACCGACAAAC GTATCTCCtT CAACAGGGCA 240 

AGTGACAGCG GCGTAGTCCC GTACGAATCG CGCGCAAAGG GGTTGCCTCC TGCAGCGCGC 300 

AACGCCGCAA TAACTGAc TG CGAATTGCAC AGTACCGCCT CGTGCAAGGG AGTGCGTCCA 360 

TACATATCCT GCTGCACAGG ATTTGCACCC GCAGAAAGGA GCATGTGCAC CGACTCTTCC 420 

TGATCTGCCA AAACCGCATC CGTGAGCGCA GACTTACCCG TCTCATCCCC CATGTGCAGC 480 

GCCACCCGAT GACTCAACAG CAACCGGATA AAGTCGACAT TCCCCGCACG AGCTGCAAGA 540 

TGCAACGGGG GTTTCCCAGA AAGATTCCGA GCATTTAAAA GAGATACATG CCGCGtCATC 600 

tTCGCGGATA AGCACGTCAG CCGAACGGAG CGCAGACCAA CGCACACACG CATGCAACAC 660 

CGTATTTCCA AtGCGTCCCG TGCGTCCACA AGCGCAGGAT TACCTGCCTG AGGATGCAGC 720 
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AAAATTGAAA TAACCTCCGC TGCATCTGAC TTTACCGCAC TGAAAAGAGG CGTTTCTTGA 
TTCAAATTGC GTGCCTCTAT CTCTGCCCCC TTGCGCAAAA TACCATTAAT CGCCTGCGTA 
AGTTTCCACT CGCACGCCAG ATGCAGCGGC GTGTTTCCTC CTGTATCCTG AGCGTGCACG 
TTTGCCGCAG TCAGAATCCA ATCCTCGCGA CCGCCACTAG TTGTCAACGC TGTTTTCAGC 
GGTGACACGC CATGCACGTT TGTACTGAAG ATATCAGCTC CTTCCCTCAT CAAAAACTCA 
CCGACAGCAC GGTCATCGTT TGCAACCGCA TAGTGCAGCA GCGTGTTTCC CATCGTGTCA 
CGTGCAATCA TCTGTTTTGG TTCGCGAAAT AAGAATTTAA CTATGTCAAG GTGCGCACGC 
CTTGATACTG CAACGTGCAA AGGATCGCGC CCCACCACAT CTTGTTTATA CAAGTTGCTA 
TCGGTGACGA CAATTTTTAC TGTCTCCAAA CCGCGCGCAA GTGCCTTGGT AAGCGGCGTT 
TCTCCGCGCA TATCTTCTGC ATGGATATCT GCCCCAAGAC TGACAAAGTA CGCAtCAAAT 
CCCGATGATC ACGGTCAATC GTGAGGACAA GCGGCGTTTC GCCCTTTTTG TTCCGTTCAG 
AAATATCTGC GCCTGCGCCT Ac TAGACGCT CTACAAACGC CCTGTCCATA CCTAAACGGG 
CCGCGACGTG CAAAGGAGTC TCGCCATAGT CATCCTTTAT GGCGACAGAA GCGCCTGCAT 
CAAGCAGTGC GCCAACCAAA CGGACACGGA AGGGAGCAGG AGCGACAAGG TGTAAACAGG 
TGTTCCCCGA CGCGTCACGC ACGTTTGGAT CGGCCCCACT GCGCAcnCAA AGACCGCaGC 
ATCCACCTGC CCTGCACGCA CTGCTTCGTG cAAAGGGGTG GcGCTGGaTA GTTTTTTGCA 
TTGAGATTGA C 

(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1462 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 
TGCGCATCGy GTGaGAGGaT TGACATTTTT ACTTGCTGAC GAGGCAGGCG TATTCAAGAC 
GGTGTATTCT GATTTTGAAG AGATGGCAAC GCATGAGATG GAACATCAGT CGTTACAGTA 
TTTTTTCAAT CAAGATGTTC TTATGATCAG AAAATCGGAT CCAGTTATTG ACCATGCACG 
TCAGGCAATT CGGACAGATA TCCTGCATTT TTTTGATATA ACGCACGCCC AGATGATTGT 
CTTGATTCGG GAAGCTCAAA AACTTATTGG ATGTCTATGT TTTGGTATGA AAAACAACGG 
CGTAGAATAT AGCAATCATG ATCAGCAtGT TTTAGAAAAG TTGTATtCAC ACTT t GTATT 
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GGTTTCCTAT TACTTACAGA aTATtGCAAA GCAAGACGTG GTTATCACGg TGGACAAAGA 
ACTTAAAATG TCCCATCAGA TTATTGAGTC AATACAACGG aAAAGGGATT TTATTCAGGA 
TGCCTCCGTT GAGGTGGATT CAATTGCGTA TTCTGCGCAC CAACTTGGGG GCGATTTTGT 
TGATTTCATT AAACTGTCCG AGAAAAgATA CCTGCTAGTT ATCGGGGATG TATCGGGGAA 
AGGTCTGGCA GCGAGTATGT CAATGGTGAT TTTGAAGTCT GTACTGAGTA CCTTTCTGCG 
GGGACTGTGC CTGGAAGAAA CGGCAGTATT TACAACCTTT ATTGAGAAGA TAAACCGGTT 
TATCAAAGAC AATTTGCCGT GTGGGACTTT TTTTGCGGGT GTATTCTGTA TTCTtGACCt 
GGCAACCCAT ACGCTCTACT ATGCGAACTG TGGCATACCG CTCATG TCG A TGTACGTCGC 
TTCATACAAG AACGTGGTGG AGATACAAGG CGAGGGGCGC GTGCTGGGTT TTGTTAAAGA 
TGTTATGCCC TTTTTGCGGG TGAGGAAAGT TCAACTCGGT CAGGGGGACG TGGTGGTATT 
TTCCACTGAT GGAATGGTAG AAGTACAAAA TTTGCAGCGG GAGCGCTTTG GTAACGAGCG 
TGTGGATAGG ATTCTACAGG AAAGTCATGG TCTTCCGGTT TCTCAAATTA CCCGTACTAT 
TTATGCTCGG CTGTGTGAGT TTATGGCGCG AGATATGCAG GATGATGTAA CTGTTCTGGC 
AATAAAGTGC CTTGGGCCTC GGTAGGAAAT GAGGAAAAGT TCGGGAGCGC CGTATGGATA 
ATATAAATAT CGCCAAAGAC GTTCGGCCTG GGTGCGTTTT ATTAACGGTG ACTGGAGCGG 
TCAGCTCCTA TACTTACGGG GAGTTTGAGT CGCGTGTGCA TGGGGCGCTC AAAGAGAATC 
ACGTTGTTTT GGATCTCTCC GGCGTGACGG CTATGTCTTC TTCGGGATTG GGGGTGCTTA 
TCTC TGCAT A CGATGAGGGA CTGAAGTACC AGCGTCGTCT GTGCATTCTT AATCCTTCTG 
AGAGCGTAnC AGAGCGATAG AG 
(2) INFORMATION FOR SEQ ID NO: . 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1013 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
TGGAACGCGC GGTTATTACG CGTCCTGCAT ACGCGGTGGA TTATGCGGTG CTATTCCCTG 
TACAACTTGG TATTGATTTG CAAACAAAAA GGGTGAGCGG GCTCTTTTCT GCAGGTCAGA 
TTAACGGAAC ATCCGGCTAT GAAGAAGCTG GAGGTCAGGG TATTATCGCC GGGATTAACG 
CTGCGCTGTA CGCGCGCAGT ACTAAAACCA AAGAGGAGTA TCATCCATTT GTTCTGAAAC 
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GCGACGAAGC ATATATTGGC GTCATGATAG ATGATCTTGT AACACAAGGA ATAGACGAAC 300 

CCTATCGGAT GTTTACCGCG CGTGCGaGTA TCGTTTGAAA CTCCGTCACG ATACTGCGGA 3 60 

TGAACGTCTT ACAGAAAAAG CTTACGCCAT TGGGCTGCAG AAGAAATCTG CTGTAGAAAC 420 

GTTGCAAAAA AAGATGCGTA CGAAGCACGA GATCTTGCAT CTGCTTCAGA CCAACAAAGT 480 

TAGTCTTACC CATGCAAACG CATATGTTCA GCTGAAGCCG CATATAGGTA AATCGTTTGC 540 

AGcTACGCTA CGTGATC CGG TAATACCTCT TGGGCTTAwC kCTTCGCTGA ACGAGCAGAT 600 

AGCGCAGTTC CCTTTGGAAG TGTTCCAGTC GGTTGGGGTG GAGATACGCT ACGAACACTA 660 

CATCGCTGCA CAGGATCAAA GAATTGCACA AG TGG AG AAA ATGGAAGGAA TAAAGATACC 720 

AGCGCATTTT GATTACGCGC GTATATCAGG TCTCTCTGTA GAATCCCGTA CACGATTGGA 7 80 

ACACGTTCGC CCGGACACTA TCGGGCAGGT TGGGAGAATG CGCGGAATCA GACCCTC TG A 840 

CGTAATGCTG TTGCTCGCCC ACTTAAAGCG GTAGCAGCTA CCGCAGAGAT AGAAGAACCG 900 

CCTTATCAGG CAGGTGTTTG TACGTACTTT TAACGCACAG CAAGGAGCGC TTCGGCGTGA 960 

AGTTCGGTGA TAAGGCCACA GGAGACCATA TCAAACAAnT GTnCGCTATT TGT 1013 



(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1129 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

GTAGTCACGG CGTGTGCCCC GCGTCAACCA ATGCGGTAAT ATCTGCCAGC CGAATTTTTT 60 

CATGTGCAAG GCAGTCCCCT ACGCGGTTTT GCGAATCGAC GAGTACTGAC AGATATACAT 120 

CTTTCAGTTG TGCAGGCGCT TTATAACGCA AGAACTGAGA AGCTGCTGTC CGATCAGCAG 180 

GCAGGCGCAT ACGTGCACGT GCAAGATCTA GCGTAAgcTG TGCGTTAATG GTACCGTTTT 240 

TCCAATCCTG CACCGCATCG GTGTGTACGG CATCCTGGGT CTGCGCACCA GACGGCGCGT 300 

GAGAAAACAG AACAAAAAGG AGGAGGAGAA TGAAACGCCT CGCACTGCTG CGTACCACAT 360 

GGGGCGCACT TGCGTGCGTC CACACCAGGG AGAGAGCGCC GAACGGAAAA CGGCACTGAT 420 

TTGAAAAAGA GGCAGCACGC GTCAGGCAGG ACACTCCCAT ACCCTACCGT CTCGGCGTTT 480 

TGAGAATCAT TCCCAAGGAG AGCGTGGCGT CTACTGCTAG ATTGTTTTCC TCTGCCAGGT 540 

TTTCTACTGA AACGCCGTAG CGCTTTGCTA AGGACCAGAG GGTATCsCCC TGCTGCACTA 600 
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CATGCTTGCC GACAAAGGGT GCTGCAGGGG TTGCCCGCGC CGCTGTGCTT TCCGGCGGTG 
GTGGCACGCT CGGGGGTGCC GGCGCGTCTG CTCCCGCGCG CGTGTCCGCg CCCCCGGGCG 
TCGGCGGCGT CATTGAGGAC GAGGGAAnTG CGGACAtTCC GGCAtGCATG CTCTTtCTGA 
TCCATTACTC CAGGTGCACG CGTGCCTGCC CCCCGCAGGc GCGCGCCGGG TCTGCCCArT 
CCGGATGGTG GGAATGATGA GCTTTTGTCC AATCTTCAGG TGAGTGGCGC TGTGTGCGCG 
GTTATGGGsT TTTAGCGTAT CGACGCCCAA CCCATAGCGG CGGGCGAGTG CGTAAAGCGT 
GTCTCCTGAA CGGATGGTGT GCAGGGTGTG ATGGACGAGC ACGGCGCCTG GGCGGTTGAG 
TACTGCCTGG ACCGCCTGCG CATGTGTGCT TGGCACCCGG AGGGTGTACG CTGCATTCGG 
GGGAGTAATT GAATAGCGGA GTGCAGGGTT GAGCGTGTGC AACAATTGC 
<2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 
GTGAGCTGCG TGCAG TAG AG ATTGGAAAGC TGAAACAGGC GATTTATTCC CTCGAAAACG 
ATTTAAGGAA TGAGAAAGTT TCGGCTGAAA GGCGGCTCCG CGTCCTCGTC TCATCAATCA 
TTTTTTCTGG CTCGTTGATC ACTATGTGAA TGTGCAAGAA GACAAGAGGA ACATCGACGA 
AGTGTTACTC AAAATTAAAT TGCTTGATGA AGCGGTTTAT CATCGCTATG TGAGGATAGC 
GTGAAAGTGA GGGTGAAGTA ATGTCAGAAC ACATAGAACA CGACGTTCGG GAAATGCTCA 
ATGAAGAAAA ATGGACACGC GCGACCTTAC CGCGTATTCT GCGGAAAAGT TTAAGGAACT 
TGACAGAATC ATTGCGGAGG C GAAAAG AC A ATCTATCCTT GATGTACTGA AAGGTATCTG 
TGACGAACAT CTGGCGCACT CGAAGAACAG TATAATCGCG TTATACATTT CTGGGATTAT 
TTCGCTTTCT AaGCAGTTGT TAGATGATTC GTGTTTAGTG ACGCTGCTGA CTATCTTTGG 
TGATAATCAC AAGAATCAAA TAGTTGAGCA CCTCTGTACC CGTGTGCTTG AGTACGGTGA 
ATCAAAGCTT GCGTTGCGTG CGTTAGGAGA ATGTTACAAA ACCTCTGGAA ACGAACAGCT 
CTATGATGTT TGGGAACGGT TAGTTAGGAT CGATTACGAA GAGGCGGAAA TCACTCGTGT 
GCTGGCGGAT AAaTACGAGC mGGAAGGGAa TAAaGAGAmm sCTACGGAGT TTTACAAAAA 
AGCGCTGTAT CGTTTTATCG CGCGGAGGCA GAACGCGGCC ATAAAGGAGG TTTGGACTAA 
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GCTTGTTGCA CTGATTCCAG ACGATGTCGA GTTTTTTTAT CGTGAGCAGA AGAAAATTTC 
AGAGAAACTG GGAGAAGGGC GCGGGAGCGT GCTCATGCAA GATGTATATG TCTATTACAA 
AGAAAATGAG GATTGGACAA CGTGCATCAA TATACTCAAG CATATTCTTG AACATGATGA 
GAAGGATGTT TGGGCGCGTA AGGAAATCAT AGAGAATTTT CGGTGTAAGT ATCGCGGACA 
TAGCCAGCTT GAGGAGTACC TAAAGATATC GAACATTAGC CAATCTTGGC GCAATGTCTT 
TGAAGCCATT AATGATTTTG AAAAGCATAT TTCCTTTGAC GAGGGTAGTT TTGTTTTTCA 
TCGAACGTGG GGGGTAGGTC GGATTGCGAA GGTGTGTAAC GATGAGTTAC TGATCGATTT 
TGCGAAAAGG CGTGCGCATA CCATGCTTTT GAAGATGGCT ATTAGCGCGT TGCAAACCCT 
TGGCAAAGAG CATATCTGGG TGCTTAAGTC GGTACTGAAG CGGCAGGATC TTGCTGCGAA 
AATAAGGcAG GATCCTGAAT GGGCACTGAA GGTGATCATC ACAAGTTTCG ACAATAACTG 
TAACC TC AAA AAGG TTAAGC AGGAATTAGT TCCTTCTTTG CTTtCTGTGG GGGAGTGGAC 
GAGTTGGAGT AC GAAAGC AC GGAAGATTTT GAAAGAAAGT ACTGGATTTG CTGCGAATCC 
CAGCAATATC GATTTTTATA CGGTGCGGAG CTGTCCTGTT TCCCTAGAAG AAAAACTtGC 
TGTGGAATTT AAGGCACAAA AAAATTTCTT CGCGCGCATC GACATCCTCA TAC CTTTATG 
GACAAGGGCA GATACAGATT TCTGGAACGG GCC 
(2) INFORMATION FOR SEQ ID NO: 163: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
CCGTTATGGT ACAGGAGCGG AAGACTTCGA GTAATAAAAC CAAGCGACTT GAGTGCTTCG 
TTCAAAAACC CCTGGTTCCC CAGG AT AG TG ATCCAGGCGT TTATCCTGAT GAGAGAATTG 
GTCCAAAAGG GGACGATAAT TAGGAACAGG AAGAACGTTT GCCTCCGGCT GCGAGAGAGC 
GCGTACCTGC AGGAAGTGCC AGGAGCACGC ACAAACAGGT GACGCCTGTA CTGATAAGCA 
GGGTGCGAGC CAGCAGTGCG CCATAGCCGG AGGTAAGGAC TTGCGCATAC GCCCGGATGG 
AAAACTTCCA CACAACGCCT CCGTACAGGC CCTTTTGCAA AAAGCTGTAC ACAGCGACCA 
CCGTAAGGGG GCACAGAAAG AAGACTGTAA GCCACGCAGA CAAGGGGAGC GTGCACCAAA 
AACCTAAACT GCCTACGGTA CGcGTCGCGC CACGCGCAGC CGGsTGCTcA CGaCTCGATG 
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TCCTCAACCA CGTAGCCGTC GTGTGCAGAC CAGGACACGT AAACAGTGTC CTTCCACGCA 540 

ATCTCAGGGC CAGTGTCGAG ATACTTCTGG TGTTGCTGAA ACACTTGGAT AATAGCGCCA 600 

CTTTCTAACT GGACGAAAAA CTTAGATTGG AACCCTGCAT ATACAGGCTC CTCTACAAAA 660 

CCGCGAAAAA CATTGAGCGG CGCACTGGTG GTACCCGGGT CTTCAAGGGA AATGTGG 717 
(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1283 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

GCTATTGTCT GTCCCAACTC AGTACGTGTC TTTGCAAATG CTTTCCCCAG GGCGTACCTC 60 

AACTGTTTTT TCGTATCGTT GAGAACAACA TCAAGCATTA CCTTCTCGAT ATGC TG AT AC 120 

AAAGCACTGA TGGTATATTG CATCCCTGCT GCAGCAACGC GTGCACGCAG TGTGTCGCTT 180 

TTTTTCTCAT CCAGGAGGAC ATCACGTAGC CAATcATTTC CTTCCCCAAG ATTATATTTA 240 

GCAACAAGAT CAACAGCCTT ATGTAACACC GCATCAACGG GATCAGACTG TGCCCGGTAC 300 

AAAACATACG GGAAATACGA CGCACACAAC ACCCGTTCTG CAACTGCTAa TGCCTCTAGT 360 

CTTGCTCTGT Aa TGTGTATC CCTAAAACCT TCAATGACAA TATTACGCGC TTCATCTGTA 420 

TCGAGCAAAC CTGCTCCTTT AATCGAGGCT ATTCTCAGAA TAGGATCTGA CTCCTCAGAT 480 

AACACGGATA ATATTTCCAC CGCATCGGAA CGTCCAATAC CTGCCAGTCC TACAGCAGCA 54.0 

GAAGCTCGAA CGACACTATT TTCCTCCGAG CTCATTACCA CCAGTTTAAA AAAATCAAAA 600 

GcATGCTCAG CATGCAACTG CTCAAGAGCT GCCATAACGT TTTGTTTCCT GATCAACGTC 660 

TTTTTATCGT CATCGAAATG AATATTCTCA TAATACCTCA CTAAAAATTC GGAATCTTCC 720 

GTACTCCCCA TGTTTCCAAG TGCGACAATG CATTCATCAG CATACTGAGA CACTTCACTG 780 

CTCAACACTT CCCTTAATAA AGGAGTCAAC TCTCTCGCCT CAAGACCTGA GATGTATCGT 840 

ATCGCTGACT TAAC TACCAT CGGATTATGT TCCGTTACCT GTTGCAATAC ATCCACCGCA 900 

ACAGACTGCG CACAATCATT TTTCTGAAAC AAGAAAAAAT CAAACAACAA CGCCTTTAAT 960 

TCAGAACTTT TTGTGCGTCC GCACAACATA CACAACGTTT CGTTTAATGA GGCGTTATTT 1020 

TCCTTCTTTA ACTCTTCTAT CAAGGAGATA ATATCAGAAA CTAATCCATA CTTGATGGTA 1080 

TTCATCCTTT TCTTTAGAAG AACCGTTTTC TCATCAGTGA TATCTATATG ACGCTCCTGT 1140 
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TCTTGCACGT GCTGCGTTCC CGTTTCCTTA GCAAACATGG GGACACCCCT CCCTAAGAAA 
AGGATTGCnA CATGCACGCT ACAGTAGTGC ACGAAAAGAC TGCnCCCTTT CACCCCATTT 
CCCCTCCGTT CCATTCTATA TCT 
(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2529 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 
CAAATCCAAC TGnAAATAGA TACCCCGGcT TAACCCCCAG ACAAGTGAGC GCGTGAAACA 
GTGTGCACAG CGTCTCTACC GAATCGTTCA CCCCGCGCAA CAGTACCGAC TGCGATTGCA 
CAGGGAGGCt GCGCcTACGC AGGCCnCGAG CACCGCGCGC TGCGTAGAAC CGAGCTCTGC 
CGGGTGATTA ATATGCGGAA TTATCCACAC CGGCTTCATC TCCTGCAGAA ACGCAATCAG 
CTCGGGAGTA AAGGCCTGCG GAGCAAAGGT GACTGCGCGA GTGCACAGAC GAATAATCAA 
ATC CGGCGCT ACACTGgCAG TGCsCGGAAA AGCGATGTGA CCTGTGCAAA AGAACCAGTG 
AGTGGATCAC CACCTGAAAC CAGGATTTCC TTCACCGAAG GGGTAGCACG AAGATACGTA 
ATAATCTTCT CGCGCTCTTC GTTGGGGATC CACCCTGCAC GTTGGGCGAT GAAACCGCGG 
CGAAAACAAT AGCGACAGTG TGAAAAGCAA CGTCCTGTTG CCAACATCAA CACACGATTC 
GCATACTGAT GCACCAAAAA GGGTGTCACG CAtACCGGTC CTCACCCAAT GGGTCGGCAC 
ACTCGCAAGC ATGCACCACA CGCTCCTGTG GCGCAAAGCA CACCTGACGT TTCAGCGCCT 
GgCGTCCGCG CCCTGCGcTT GTGCAATTAA ATGCGCATAC GCTGGAGAAA TATGCTCCGT 
CAGCGCATCT GCCGCGCAAG AGGCAGGACT CAACGTCCGC CAATGCTCAT CAGCACGCCC 
TGCACCTCGT CTCTTTCTCT GTTCCCGGGT ACACTCAGCC ATAGACACGG ACC AC GAGTG 
CGCATCCTAC GGTAAAAATG CATGCGTGTA CAAGTCACGC CGTGCACCGC ACCCGTGCTG 
CAGGATCGCA TACCGTAGTG TTGACACACG TACCTATTTG CAGAGACCCT GGTCCCCATG 
GTCAGACTCG AGCGGCTAAA GAAGACATAC GCAGGTGTTC CTATACTTCG AGATATTTCT 
CTAGAGATCC CAGCGCACGG AATGTATGGA ATCATCGGCA AAAGTGGTGC AGGAAAATCA 
ACGCTACTGC GCATCATGAG TCTTTTGGAG AAACCTGACG AAGGAGCCGT TTTTTATCAC 
ACCACGAGGG TAGATTTACT GCGCGGTGCT GCCTTGCGTG CACAGCGCAG GCGCATAGGA 



1200 
1260 
1283 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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TTGATCTTTC AACAATTTCA TCTGTTTTCT TCCCGCACCG TCTTTGGGAA TGTTGCCTAC 
CCGCTTGAGA TTGCACGGTA TGCACGTAAG GACGCCTACG CGCGCGTGTT GCATTTGCTA 
CACTTGGTTG GTCTTGCAGA CAAAGCACAG GCGCGTATCA GCACGCTGTC AGGTGGGCAG 
AAGCAGCGCG TAcCATTGCG CGCGCCTTGG CTGCAGAACC TGCAATACTC TTCTGCGACG 
AAGCAACAAG CGCTCTCGAC CCTCAAACAA CACAGTCAAT TCTGACGTTG CTGAAAAATG 
TGCAGTGCTC ACTGCGTCTG ACGGTCGTAT TGATTACACA CCAGATGGAG GTGGTACGCG 
ACTTGTGCGA TCGGGCCGCC GTATTGCATG AGGGAGAAAT AGTGGAAGAA GGAAGGGTGA 
CACAACTTTT TGCTGCGCCA CGGCGGCTGA TCACACAGCA GTTGTTGTCG GGCTGTTCTT 
TTGCCTCTTT TGCAAAGTCA GAACCCTTCC ATCGAATGTC TTCGGGTGCG TGTGCCGTGC 
ATGCTATTGA CAAGGCACAC TGGTAATGGC GAACCAGACA CTGTGGCTTT TAGTAGCTCG 
TGCAACCGGA CAGACAAGTC TGATGGTGTG TGCTTCAGCA AGTATTGCGC TAgCAGCGGG 
AACCCCGTTG GGGATATTGC TGTGCGTAAT GTCGCCTGGA CACGTGTGGG CGCATCtGCG 
TGGCATCGTG TGTTAAGTTC GTCAATGAAC GTcTGCGCGC TTTCCCATTT GTGATTTTGC 
TGGTGGTGTT GCTTcCgCTC TCGCGTATGC TCACAGGACG CACAGTGGGA ACGGCGGCGG 
CTATCCTCCC GCTTGCGAtA cTGCGCTCCC TTTCGTGGCA CGGGTGATTG AAAGTGCTCT 
GCTGGAGGTG GAGCCAGGGA TAATCCAAGC GGCGGTGGCA ATGGGTTCAA GCATGCGGCA 
ACTTGTACTA AAAATCATGC TGCCTGAGGC TGCTCCTGCA TGTGTTTCTG GTGTAGCACT 
GATGGTAATT AATCTAATTG GAT AC TC AGC AATGGCAGGG GCGATTGGGG GAGGAGGTTT 
AGGAGACGTA GCGATCCGCT ACGGGTATCA GCGCTTCCAA CCAGAGGTGA TGACAATGGC 
AGTGCTTGCA ATCCTGGCGC AGGTTGCG t A ACGCAATGGA TCGGGCGTAT AATCTGTACC 
CGAATACGAG CGCGtCAGgT AGTACCCCGC CAGAGTTAGG CAGGACGTCT GTCCTTGCAT 
GGGTAATCCT TTGATCTTTT CTACGGGGTC TGACGCTCAG TGGAACGAAn ACTCACGTTA 
AGGGATTTT 

(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 
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TGACGGCTCT TACCCGATTG AGCGAAAAGA 
ACAGCTTATG CTGkCCAGGT TGACGACCTG 
TGGGACGTGG ATGTGCGTAT TCCAGAGGGG 
GCGGGCAAGT C G AC CC T ATT GAAGGCGATC 
GTGCGTGTCT TTGGGCGTCC TTTTTCAAAA 
CGCAnTGCAG TGGATTGGGA TTTCCCTACT 
TACGGTTCGC TCGGTTGGAT TCTTCGTCCG 
GCGATCGAGG AAGTAGGAAT GGGCGCCTTT 
GGTCAGCAGC AGCGCGTGTT TCTCGCGCGG 
ATGGATGAGC CATTTCAAGG TGTGGATGCA 
AAAACGCTGA AAGGGCGTGG GAAAACGTTG 
GCAGAGTATT TTGACCGCGT GCTGCTTTTA 
GTGTCTGCCT TCACCGAAGA ATACGTTCAA 
CTTTTTCCGA GAGGAAATAA GGAGGATGTG 
TGACTATACG CTGCAAAATG TGGTGTTAGG 
GGTCGGCAGT TTTGCGGTGC TGCGTCGACA 
AACCTTCCGG GGATTGTTAT CGCGTTTCTT 
TTGCTGGGTG CTGCCCTCAG TGGTTTAGTA 
ACTACAAAAA TTGATACCGA TGGTGCGCAG 
GGGTTTCTAT TACTCACCCA CGTGCAGAAG 
AAATTCATTC TAGGGCAAGC GGCCACGATT 
ATGGAGGTGG TGATCGGTTT GCTTGTACTG 
TTCGATCGAG ACTTCTCTGC GGTGCAAGGT 
ACGGCACTCA TCGTAGTTGC AGTTGTCGTA 
AGCGCACTGC TGACTGCGCC TGCAGTGGCA 
TTATGCGCGC TTGCTGCTTT ATTTGGGGGT 
GCCCAGGTTC CCAGGCTTTC TACTGGCCCC 
CTTGTCTCTA TTATGCTTGG TCCTCAGCGG 
CGGGTTTCGC TTCTTCAAGA GGAGGGGTAG 
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AGGGAGCAGT ATATGGCAGA GATATCAGCA 60 

ACGCTTGCGT ATCGGCAGAA GCCGGTGCTT 120 

GTTATCGAGG CCATTATCGG TCCTAATGGG 180 

ATGGGTCTTC TGCCTCTCGC TTC CGGAGAG 240 

GAGCGGCGAC kTGTTGCGTA TGTCCCGCAG 300 

ACCGTTTTTG ATGTGGTGCT CATGGGTTCG 360 

GGAAAAAGAG AAAAGGCGCG TGCCCGGGAA 420 

TTAGACCGAC AAATCAGTGA GCTTTCAGGC 480 

GCCTTGGTGC AGGACGCGGA TCTTTACTTC 540 

GCTACTGAAC AAGCAATCGT TACTCTTTTA 600 

CTTGTTGTGC ATCATGATTT GCAGACGGTG 660 

AATGTTCGCG TCATCGCTGA AGGGGCCGTC 720 

AGAGCCTATG GCGGACGGAT TAGTTCCACC 780 

CACGATGCAC GCGCTCATGC GTCTGTTCTC 840 

CACGCTTTTT TTGGGTTTGG GTTCTGGGCT 900 

AAGCCTTTTC GGTGACGCAG TTTCTCATGC 960 

TTAACCGGCA CGAAGTCTAC TGAGATACTT 1020 

GGAACTGTGG TGATGCTAAT GGTGATGCGT 1080 

GGCATTGTGT TGGGTGTTTT TCTTGGGTTT 1140 

TCGCCCCAGG CGGCAAAGGC TGGTCTGAAC 1200 

TTGCAGCGAG ATGTCCTGCT CATCATTGCG 1260 

CTGTTTTGGA AAGAACTGAA GCTTTCTACC 1320 

TTTTCTCCAC AGCTTATGGA GTTCATGCTC 1380 

GGGGTTCAGG CAGTGGGGGT TATCTTGATG 1440 

GCGCGGCAGT GGACAAACAG TTTAAGGGTT 1500 

GTCTCAGGTG TTTCAGGTTC GGTTGTCTCT 1560 

GTGATAGTGT TGGTGCTGAC GGGTATTGCG 1620 

GGTGTTTTGT ATCAACTGTG GCGGAGAAGA 1680 

AATATGACCA TGGAGGTTGT GCTTATTGCA 1740 
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GTGGTCGTGT CGGTTGCGTG CGCGCTGTGT GGGGTTTTCT TAGTGTTGCG TAGAATATCG 
CTGATGAGTG ACGCGATCAG TCATTCGGTT ATCCTGGGGA TAGTACTCGG TTATTTTCTG 
AGTCGTACGC TTTCTTCTTT CGTGCCTTTT GTGGGGGCAG TGATTGCGGG GATATGTTCG 
GTAATCTGTG CAGAACTTTT GCAGAAGACA GGGATGGTAA AGAGCGATGC AGCaGTCgGG 
CTTGTGTTCC CTGCAATGTT TGGGTTGGGG GTGATCCTTG TGTCGTTGTA tGCAGGGAAT 
GTACATCTTG ATAC AGATGC GGTACTGCTT GGGGAAATTG GACTnGCGCC CTTGGATAGG 
nTTTCGTTTT CAGCTTGGTC CTTGCnTAGG AGTnTGGTAn AGATGGGGTC CGTCnTGTGT 
GGATTACTGC TGTTGCTTGC GCTCTTTTTC AAGGAACTCA AGATTTC t AC GTTTGATCCG 
GTGCTTGCCA CGAGTTTgGG TTTTTCTCCT ACGCTTATTA ATTATGGGCT TATGCTCGsG 
G TG AG T ATT A CCTGTGTGGG AGCCTTCGAT TCGGTGGGTG CAGTGTTGGT CATTGCATTG 
ATGATTACAC CGCCTGCAGC AGCGCTTTTG TTGAcAGAtA mCTtgTwGTt GATGTTGGTC 
CTTGCTTCAT TGCTCGCCTC TTGTGCGTCC ATTAGTGGGC TTTTTCTTGC GGTGAAGATA 
GACGGCAGCA TTGCAGGAGC AATGGCTACC ATGGCGGGCG TTCTGTTCGC GTTGGTGTAC 
CTTTTCTCTC CAAAACACGG GGTTGTGCGC AGGTGTCTGG TAATGCGTGC TTTGAAACTT 
GATCTAGATG TGGTGACACT TGCCGTGCAT CTTGCAAcaC AC t TACACGG TGGAGCGCAG 
CGTGGAGTGC GCTGAAGTGC ACCTGACAGA ACATGTGAGT TGGTCTGcGC GCAGGGCGGC 
CCGCGTGGTG CGTACCGCGC TCAGGCGAGG G ATGG TAG AG CGTCACGGTG CCTTGCTGCT 
ACTCACTGCG CAGGGTGTGT nCGCTCGCGC AGGCGCGATT GGATGTATCC GTGTAGGCTG 
AGTCGATGTC GTTAGTGTCA GATATTGCAG CAGAGAATTA TTTGAAGACA GTGGTAAAGG 
CGTTGGCGCG GTCTCGTCGG GAGCGCGTGG GTACCGGGGA GTTGTCTCGC CTTTTACACG 
TGACGCCGGG GACTATCAGC ACAATGGTGA AGCGCTTGGA AAAGGGTGGC TATGTGCAAC 
GCACGcATCG TCTTGGCTGT ACGTTAACCA GAAAGGGGGC AgTTTTTGGA TCTGCaGTGT 
TAAGGAAGCA TCGCTTGTTG GAGAGTTTTC TTTCCCAGGT ATTGTGTTTA GAAGCAGGGG 
TGGTGCACAA AGAAGCGGAA ATGCTTGAGC ATGCGTGTTC TGACGAGCTC ATCGACGTTA 
TTGATCGCTA TTTGCAGTAT CCTACGCGGG ATCCTCACGG GCAGCCGATC CCAAGAAAGG 
ATACGCTTTT GGATTTGTAT GTTGAGGACG ATGTGCCAGG TGTATGATCT TTTTGTATGG 
GGTGAGGATG CGCCTTTTGT CAGATAAAAG GGGATGTGCA AAACGTATTG TTGAGAGGAG 
AGGGCCATGA AGCTTGTGTT GATCCGTCAT GGAGAAAGTG AATGGAACAG GCTGAACCTG 
TTCACTGGTT GGACAGATGT TCCGCTTACC CCACGTGGGG AGTCGGAAGC CCAGGAAGGA 
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GGCCGCGTAC TGCAAGAAGC GGGGTTTGAT TTTGACCTAT GCTACACTTC TTTCTTGAAA 3540 

CGTGCCATTC GTACGCTCAA TTTTGTACTC CAGGCACTGG ACCGTGAGTG GTTGCCGGTT 3600 

CACAAAAGCT GGAAATTGAA CGAGCGGCAT tATGGGGATC TACAAGgTTT AAATAAGACA 3660 

GAGACGGCGC AGAAGTATGG TGAGCAGCAG GTTAGGGTGT GGCGTCGCTC CTTTGATGTG 3720 

GCTCCTCCTC CGCTTACTGT AGGGGACGCA CGTTGTCCGC ATACTCAAGC CTCCTACCGG 3780 

GGGGTATGCG CGTCTGGTCG GACGCCAtAC TTCCGTTTAC GGAAAGTTTG AAAGATACCG 3840 

TTGCGCGTGT GGTGCCGTAT TTTGAAGAGG AAATCAAACC GCAGATGATT TCCGGACAGC 3900 

GTGTGCTTAw TGTGGCGCAT GGTAACTCGT TGCGCGCACT GATGAAGCAC ATAGAGTCTT 39 60 

TGGATGAGAC TCAGATAATG GAAGTAAATT TGCCTACCGG TGTACCGCTT GTCTATGAGT 4020 

TCGAGGCGGA TTTTAC CCTG TGTGGGAAGC GTTTTTTAAG 4060 
(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2074 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

CTTTTTACCC AACTCATTGC CAATGCGGTT ATGGAAGCGG GTAACGAGCA TGGTTTGAAA 60 

ATCATTGAGA ACTTGCAGGA TGAGGAGAGT GGCGATGAGC TTGACGAGTC CGTTTCCTTG 120 

CACGAGGAAG GGCGCGAAAT TACTGACTAT GAAAATTATA CTCCTCCTGA GGAGCGTGAG 180 

TATTCTGTGA ACGATGAAGG CGATGTGTTT GATGAGGATG AGTCGCTCTA CGAGGGGCGT 240 

TAGGTGTGCC CTCCGCGGTC TCTTTTGGGA TGTGTGTCTG GGTAGGCATG TTTGCATCGA 300 

AGTCTGATCG GAAAATGCTG TCAGGAGGGG TACATGGAGA TTGCTGCTCG CGACGTTAAG 360 

TCTTTGCGTG ATAAAACCGG GGCCGGGATG ATGGAtGTAA GCGTGCGCTC CAGGAGTGTG 420 

CAGGGGACGC TCTGTGTGCA GAAAAGTATC TTAAgGAGAr GGGGCTTGCT GCCATCGAAA 480 

ACAGGCGTGG GCGTGCCACT GCTGagGGAG TCaTCGTTAt TAAAGCACgG CaTGc AGAgG 540 

GCgCgGCCTG TgGGGCGAGC GCTGTAGCAA TGGTTGAGCT TGTTTGCGAA ACAGATTTTG 600 

TGGCAAAGAA CGCAGAGTTC ATCGCCCTTG CTGAGCGTAT AGCTCAGGCG GTGCTCGAGC 660 

ACGCGTACAC TGAGGTAAAC CAGGTnTGCG CGATATGGTG GTGGACCTCG CAACGCGCGT 720 

ACGGGAAAAT ATGAGCTTGn ACcGCCTTGC GCTCTTACGT GCCGcAGTGC CGGTGCAGGT 780 
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CAGTACCTTT CCTnTACGTG CACCCTGATA AAAAAACAGG GGTAGTGCTC TCCTTTTCCT 840 

CCGATGCGCC GGATGTGTTC CTGCGATCCG ATGTGCGGGC cTTCGCGTAT GACTGCTGTT 900 

TGCACGCGGC GGCATATACC CCTCGyTACG TGCGCGCAGA GGACGTGCCT GCTGAGTATG 960 

TGCGGGAGCA GCGTGAGGTG TTCCAAGCGC ATGTTGCGTC TCTCCAGAAG CtGCGCATGT 1020 

CAAGGAAAGT ATCGTGCAGG GTAAACTAGA GAAGCATTTG GCTGAGATCT GTTTTCTGAA 1080 

GCAGCCCTTT GTTAAGGACG wCAAGCTTTC TGTTGAAAAA AAGATGGCAG AAGTGGGTGC 1140 

CCGCGCAGGG GGTGCGCTTC GGTTTACTCA GGCACTGATA TACCAGCTAG GGGTACAGTG 1200 

AGTGGGAAGC ACGGATAGAT CCTGCCaCCC TGCAGGATGG GGAGCAAGCA GGCGTGGGGG 1260 
AGCTCGTGCT TCTCTCTTGC CGCACTGTGT TGTGAGGGGA AAAGATGGGT ATCGCTGAGT . 1320 

GCTATGAGCA GAAGATGAAG AAGTCCCTCT CAGCGCTGCA GGAGGGTTTT AACACGCTGC 1380 

GTACTGAACg TGCGACTGCA CATTTGCTTG ATCAGATTAC TGTCGACTAC TATCAGCAAC 1440 

CAACCGCGCT TAGTCaGGTG GCTACCGTTT CGTACCCGAG GCGCGTTTGA TCATTATCCA 1500 

GCCTTGGGAT AAAACGCTCC TTGCGGATAT CGAG CGTGC A ATTTTAAAGT CAAAATTGTC 1560 

GCTCAATCCC TCCAACGACG GCAAGGTTAT TCGTCTAGTG ATTCCTCCAC TTACCCAGGA 1620 

GCGAAGGAAG GAGCTTGTCA GGCAGGCGCG CGCGTTAgCC GAGCAGGCGC GCGTTGCTAT 1680 

TCGCAATATT CGCCGTGAGG GAATCGAGGA AGCAAAGCGC GGGCATAAGG AGGGACTGCT 1740 

AAGCGAGGAT GCACTGAAAG CAGCAGAAGA GGCCTTCCAA AAAGCGACTG ACGCTTCTGT 1800 

CGCAGAgTtG CACGGTACTT GGCCGAGAAG GAAAAGGATA TCCTGGAAGG TTGAGTGCCG 1860 

TGCAGCACGT GGCCATCATC ATGGATGGAA ACGGGAGATG GGCGGAAAGG AGAGGGTTGC 1920 

GGCGCAGTGC AGGGCACCGG CGGGGGCTGC AGACAGCGCG AGAGATTGTC GCGGCGCTGT 1980 

GCCGATTCGG GTGCCTTTTG TTACTCTGTA TGTGTTTTCT ACTGAAAACT GGAAGCGCTC 2040 

TGCnATGAAG TGCATTTCTT GATGAATTTA ATCA 2074 
(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2685 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 
CGCCGGGnTA TTCCTTGCGT ATGCATCGAT GATCATTTGC ACGTTTTGCC GACCTAAAAT 60 
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ATCCGCTGCA TTGGTACGGA TTACCTGAGT CAGGTGCGTG GCGATGATTG CAGATGGAGC 120 

AACTACAGTG TAmCCGACAC GCTCAGCACG ATCGCGATTT TCTTCAGAAA TCCACACAGC 180 

AGGGAGTCCA AATGTAGGAT CAATTGTTCG CTCCCCTGGA ACCTCCTCTG TAACTTGACC 240 

AGACTTTATG GCGAGAAACC ATCCCAAGCG CAnTTTyCCC CGCGCAACTT CTAATCCTTG 300 

GATTTTGAAA CAATAACTGC TCGGATCTAA ACGCATATTG TCAATAATTC GAATTTTAGG 360 

AGCAACCAAT CCGAGATCCA ACGCGGCATC TTTTCTAATA ACTGTAATTC TACTTAGGAG 420 

CTCTGCACCC TTTTCTTTAT CAACAAGAGG AATTAATCCA TACCCAAGTT CTAGCGAAAG 480 

TGGATCAAGC GGCACAATAG GACCCATTTC AGAAGTACTA TCTTGAGTCT GTTGCATACC 540 

CTTTTTATCT GAACTTTTTT GCATCTCATG TTCTTGAACG TGTACCCGTT CCCTTTTCCT 600 
TAGCTGCAGT CCCACAAAGG CAAAACACAC GGCCATAAAA AATAAAATAC TGTGGGGAAA 660 

ACCCGGCAAT ACCGCCATAA CGATCAATGC ACCTGAGCCA ATAAAATAAA CAAGTGCACT 720 

TTTTGAAAAT TGTTCCTGTA CGTTTTGACC AAATGACCCT TGATCGCTTG ATCGAGTGAC 780 
AATAAAACCT GTTGCAACAG ACAACAACAA AGAAGGAAGC TGTGCAAGCA ACCCATCTCC 840 
TATCGTTAAA TTTGTATAGG TCTGCAATGC TGCCTGAAAA CCCTCCCTAC GAAATATGAC 900 
ACCCACTATC AGGCCTGCAA TCACATTTAC AATGGTAATA AAAATACCAA TTTTGACATT 960 

GCCCGATACG AACTTACTCG CTCCATCCAT TGcTCCAAAA AAATCTGcTT CACGCTGaAT 1020 

TTGCCTcTtA yrCTcTCGCG cTTcTTCwTC GGTGATAACA CCTGnCATTA TATTCAGCAT 1080 

CAATAGACAT GcTTTTGGTT GCATTGAAGT CTAAGGTAAA ACGCGcAnAA ACTTCTGcAA 1140 

TACGCGTCGC ACCCTTAGTA ATAACAAAAG CTTGCACTGC AATTAAAATG ATGAATACCG 1200 

TAAAACCAAT TACTAGACCT TGCGTCCCGG ATCCTCCCAC CACAAAAGAA CTAAACGCAC 1260 

GGrTCATATA CCCGCTAAAC CGATCTCCTA ACGTTAAAAT CAACCGGGTG GAAGACACGT 1320 

TCAGTCCAAG TCCAAAAACG GTTGAGGTCA ACAAGAGCGA GGGrAATACA GrAAAATCTG 1380 

TTGGTTTTTC AACAAATAAC ACCATAAGTA ATATCAAAAG GTTAAAGATA AGATTAAaGG 1440 

CCwTCAACGC aTCGAGAATT TGCGTGGGCA GAGGAACAmC AATAGAAAaG ACAmCCACCA 1500 

ACACTGAAAT CGCAACAAAA GCGTCAGTAG TGAAAAgGCA CTCTTACCGT GCGCCATAGT 1560 

ATGGGTTACC TCTTACGCTG TGCATGTGTT TTAAATTTAT CCAGCTTGGT AAAAATCAGC 1620 

ACTAAgCATT AAAATATTCG TAGGGAACTT CTCTCCCGAT AGCAAC t GCG TGTACAACGC 1680 

ACGTGCAAGC GGTTTGTTTT CTTCTATCAA GATACCTGCC TCTTTTGCCA ACCGTTTGAT 1740 

TCGGTATGCA GTCCCATCAG ATCCTTTCGC AACCACAGTC GGCGCAGTCA TGTATGCAGG 1800 
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CTCATATTGC ACCGCAcTGC AAAATGAGTC GGATTAGTGA TCACAACATC AGCGTCAGTG 1860 

GTATTC CGAG CAGACTCTCT AACAAGAGAT TGCATCTGCT TTCTAACATA ACTTCTCACG 1920 

AGCGGGTCCC CTTCCTGCTC TTTTAACTCC TCTTTCACTT CCTGCCGAGA CATTTTTAAC 1980 

GAATCGATGA ATTGCCTTCT TTGGAAGAAA TAATCGGGAA GCGAGAACAC CACTAACAGC 2040 

AAACTTACTT CGAGGAGAAC TTTACCCGCA AGGGATGTAA TGTAGAAAAT ACTCTGGGTA 2100 

AGACTCACAC CCAATAAgAA ACAAACATAA AAAGATCATT ACGTATAGTA AAATACGATA 2160 

CAAAAAaTAT CGCTGTAaTC TTTATGAGAG ATTTAAGTAA ATTGAAAAGC CCTTCTGTTG 2220 

AAAAAAATGA GCGTTTGAAA AAACGAATTA CATCTGGAGA TATTTTCTTA AACTGCGGTC 2280 

GAATCGACTT TACCGAAAAT AAAACGGTCT TGTTTTGTAC AATGTTTGCC GCAACGCCAG 2340 

AGACCAGCGC AACAAAGGAT ATCGGAAGTG CAAGTTTCAT AAAATACCGC ACAAATACAA 2400 

AAAACCATCC AGTATTCTGG ATGGACGCGG TAGTAGCACG CGTAAAGAAA AACCTGAGTA 2460 

CACCGATGCA CTCTCTCAAT ATAAATGGTG CAAGCAAGAA CAAGGAAGTT GATGTGAAGA 2520 

GCATCACAAA CGCTCCATTT AGATCCgGCT TTc GGAACAC GTCCTTCTTC TCGTGCTTAC 2580 

GGAGTTTTGt TcGGTAGaTC CTCTGaCCTC CCTTCaTCCT CAGCGGCAAA CCACTGGCAA 2640 

ATCAATAATA AAAAGAGGAA GCGGAAATGT TCCTTCTTGT TCTAT 2685 
(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

AATCTTGCAC GCCCTCGCTC AAGCCTTCGT CACTGGTCCC GCCACCAAAC GTTTCGGCGA 60 

GAAGCAGGCC ATTATCGCCG GCATGGGGnC CGACGCGCTG GGCTACGTCT TGCTGGCGTT 120 

CGCGACGCGA GGCTGGATGG CCTTCCCCAT TATGATTCCC AATAAAAATT GCGCGTGCCG 180 

CACACCGTAA TCAGTTTACT GAAGAAGTGC TCCAATCCCT GCGGATTTGC CACCAATGCG 240 

GTCTGTGTTC TGCCGCCTGT ACTGCGCGTA TTCCTCTTGC AAAACTTTTG CACGATGCAC 300 

AAGAACGCGC ACTGCATCTT TCCCGTGCTC CAGTCACCAA AATAGAACCC CACTCCACAC 360 

AAAGCGTCGG GAAAACTATC CGCGAGCACC TGCCAATGCG CACCGCTGGA GTACAAACAC 420 

GCACCCTTCC TTTACACCGG CTTAAGTGCT GGACAGAACA ACAGTGTACT GTTGGCGCTG 480 
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CTTGTTGCGC ACGTGTTCGT CGTTGCAGCC ATGCGCGACA CGGTCGnTnT TTTTCCATCG 
TCAGTACCGA ACTCGGCGCA CTGAGCGCGC GCTCGTTCAA ACACTACGCA CACCACATGT 
GCCCCTGAGC GACTCTCTCG TACTGGGCCT GCTC 
(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4042 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 
CTGAACACGT TGCACGGTGC GACGCTCCTT ACAGGAATGr AAAAAAAAGA ACgCCGCGCA 
CAkTGCACCG CCCACATATG ACGCACACCG CCTAGTTTTC ATCACGAAGA CcTTCTGAAG 
AAAAACACGC GCGGTGACAC GCTTCGAGCT GTCCATTTTC CACGTGTGTG TACACCTGGG 
TAGTTGCAAT ATCTGCATGG CCGAGCAGAC ACTGAACCGA ATGTAAATCG ACACCGCCTG 
CAAGCAAATG CGTCGCATAT GAATGGCGAA ACGTATGAAC GTGCGTCTCC ACCCCTGAGC 
GGATTTCAAT ATCCTGCAGG CGCTTCCATA TACCTTTACG ACTCAGACGA CCCCCCTGCC 
TGTTTAAAAA GACAGCTCCT TTTTTTTCAC TATTCTCCGG ATGCTTCGCG CTCGTAAAAC 
GAACACGCGC CTCACGTATA TACTGCaTGA GAAAGTAACA CGCCTGTTCT CCAAAGGGAG 
CCATACGCTC TTTGTTCCCC TTACCTGTCA CTTTTAGCAG ACGTTCTGCA AAAAAAATGT 
CAGAAAGGGA AAGAGAAACT GCCTCACTGA CCCGCAAACC TGCAGCATAA ATAAGTTCAA 
AGAGCGCGCG ATCACGCACG CCACCTGGTG TGCACAGTGG AATGGATTGC AGAAAAGTGT 
TGACCTGTTC GGGAGACAGC ACGCGCGGCA GTGAATACGC ACGACGCGGT GCATCAACGT 
CTTGCATtGG GTTATCAGGA CGCACCTGCT CCAAAaTGAG AAACTGATAG AACGCGTGCA 
GGCCGCCATG TCCTTcGCAA TCGTTTTGCC TGAGACACCC GCAGCACTCC GCTTTTCGAT 
AAAACGTACA CAATCGTGCG CATTGGCGCT TTCAATGTTA CACGGCGGAT CAAGGTTTTT 
CTGAAAAAGC ATCAACGTAG TCACGTACGT CCGTGCGGTC AAAAGCGCGT GCGCCTCTGC 
CGAAATTAAG TACGCGTAAA AGGATTGCAC ACGGATATCG ACGTCTTTCA CAGACCCTTG 
TTCCTATTTC TCAAAAAGGT CGGTTCATCC AAATCTTCAT TTACCGAGGG GAGCGGAACG 
CCAAACGTGT GGCCCTtCAC TCCGTTCTTT TCCATACGCG TTTCTTGAAC AGCGCTATTA 
CGCGTAGCCA GACCAGGTAA GTTTGGCTGT TTAGAGCTTT TAGCTCTGTT CCATTCGTCA 



540 
600 
634 
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GAACTAATAT 


ACACACCGGT 


ACTCACCGCA 


CCGTACGATG 


ATGTTTTTAT 


CTTTTGAGAA 


1260 


CTGTG TGTAT 


CTCCCGCTAT 


CGAAATACTC 


GCTTGCGGTA 


CACCCGTTGC 


GATAACCGTA 


1320 


ACCCTCACCC 


TATCCTGCAT 


ACTCGCGTCG 


ATGGACGTGC 


CATGGATGAT 


AATCGCATCC 


1380 


GGATCAATGG 


TCTTTGCAAC 


CACAGACATC 


ACGCCATCGA 


CTTCTCCCAT 


GCTCAAGTTC 


1440 


TCTGAGCCAC 


GTACCGCAAC 


CAGCAGTCTG 


GTAGCACCTT 


CTATCCGCGT 


CTCTTCCAAA 


1500 


AGCGGATTAT 


TAATTGCAGC 


GGTTGCCGCA 


TCTACTGCGC 


GGTTTTCCCC 


TTCTCCCTCT 


1560 


CCCACACCGA 


TAAGCGCGTA 


CCCCTGCCCT 


TCCATGGTGT 


TTTTTACATC 


CATGAAATCT 


1620 


AAATTCACTT 


CTCCAGGAAG 


GGTAATTAAA 


TCAGAAATAC 


TTTGCACCGA CTTGCGCAgc 


1680 


aGATnCATCT 


GCAACCAGAT 


ACGTCTCTTT 


AATCGGGCAG 


CGCTTATCTA 


CCACACTGAG 


1740 


TAAATTCTGA 


TTAGGGATCA 


CAATCACGGT 


GTCCGAGTGC 


GTGCGCAATT 


TTTCGATCCC 


1800 


TCGCTCAGCG 


AGCATCATCT 


TTGCTCTGCC 


TTCAAAGCGA 


AACGGCTTCG 


TGACTACGGC 


1860 


AACTGTCAAA 


GCACCAAGTT 


CCCGTGCAAT 


CTTTGCAATA 


ACTGGGGCAG 


CACCTGTTCC 


1920 


CGTACCTCCT 


CCCATCCcTG 


CGGTGATGAA 


CACCATGTTC 


GCGCCCTGCA 


ACGCACTTGC 


1980 


AATGGCTTCA 


GCATCTTCCA 


TTGCAGCCTT 


CTCGCCAATC 


TCAGGATCAC 


CGCCTGCACC 


2040 


CAACCCCCTT 


GTCACCTTGG 


TGCCAATGGC 


AAGCTTTTTA 


GGCGCGGTAG 


AATAGCTCAA 


2100 


CGCCTGcACA 


TCTGTATTTG 


CTGCAATAAA 


CTCGACGCAC 


TGCAAACCGC 


AGCTCATCAT 


2160 


CCTATTTACC 


GCGTTTGACC 


CACCACCACC 


GGCACCGATG 


ACCTTTATGA 


CCGTTGGACT 


2220 


TAGGGTAAAC 


TCTTCGCCTG 


AAGGTGCAAG 


CTCTATATTC 


ATCATTCCCC 


TCCCATTCCC 


2280 


CTACACCGCG 


TGCCGCATGC 


TGTGCGCGGT 


TTAAAACAAG 


TTCCTCCAAA 


TATCCTTCAC 


2340 


TTTAGTAAAC 


ACTCCCGCAC 


GCTCCATCTC 


AGCACGGCCC 


TGATAAGCGC 


GCTGTCCCTG 


2400 


CTTATGGGTA 


TATTCTAAAA 


TCAGTCCTAA 


CACCACTGCA 


AACTCAGGAC 


TGCGATATTC 


2460 


CCCTGCCAAT 


CCTCCCAAAG 


TACCTGGTAT 


TCCAAGGTGC 


ACGCGCGGTG 


TATCAAAAAT 


2520 


TGcTGACGCA 


AGCTCTACCG 


CACCGGTAAG 


CTGCGCGCCA 


CCGCCGCAGA 


GAATAATATT 


2580 


TTCAATGATA 


CCACGACCGC 


TTTGcGTCTC 


CACCGTCGAA 


AGACGATCGC 


GCACTATCGT 


2640 


AAAAACCTCA 


CACATGCGCG 


CTTCAATTAT 


TTCGGCGATT 


TCTCGTTTAG 


AAATTTCTAC 


2700 


AGGAATCCGA 


TTTCCCTGGC 


TGGAGATGAG 


AACACTCCCT 


TCTCCCTCAA 


GCAGGGGGAT 


2760 


CCAGCAACAT 


CCATCTTTAA 


TTTTAATGCG 


CTCTGCAGTT 


TCAAGCGGGA 


GGTTTTTTAC 


2820 


CTTTGCAAGA 


TCAGAAGTTA 


CCTGACTGCC 


CCCAACAGGA 


ATCGAAGTGA 


TAAGCACCGG 


2880 


GGAACCCTTG 


TACATTGCAA 


TAACATCCGT 


AGTTCCCCCA 


CCAATATTAA 


TGAGCACACA 


2940 
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CCCTACATTT CGCTCGTCAT CGTTTAACAC AGAACGAACA GCAGCGAGCC CGTTATGCAT 
TAAAAAATCG ATGTGCAAGT TCGCcCGTTc ACGCAATCGA TTACACTGCG CATACACGTT 
GCAGAACCGG TGATCATATG CACyCTcTTC CAGGCGAACC CCAATGATAT TGCGCGGATC 
GGTGATGCCG TGCTGATCAT CCACCGAATA AACTTTGGGA ATAACATGAA GAATTTTACG 
ATCGGGAGGA AGAGAAACTG CACAGGCAAC TTCAaGCACC cGATCAATAT CGCTTTGATC 
AACTTCGCGA TGCCCcTTCC CcTTATCTGC AACCGCCACA ACACcTTTTA AATTTCTACC 
CTCGATGTGG GTACcTCCAa GCCCCACAAg CAGTGCGCAA tTCGATACCG GACATCATCT 
CCGCAGCTTC AACCGCGTGG TGGATACCCA CAACTGTATT CTCAATATTG ACTACTACAC 
CCCGCCTCAA ACCCTTTGAA TGACCGACGC CTACACCTAC AACCTGTAAC GCACCACCTT 
CCAACCGCTC GGCCACTACC GCCCTGATCG ATTCGGTACC GATATCTAAG CCGACAATAA 
CCTCACCCAT AACTTTCCTC TTCTAGCGAT ACACCGCCGT CCCACCTCTC ACGTCAAGCT 
CCTTAATGCG CCTCTGCGTT TGCCACTCCC GCAGCGCATC AACGAGCAAT ATG AC AT AC C 
GCAACTTTTC TTCGCTAAGG TTTTTGTCCA TGCGTACTCT GATAGGTGCA CGCACCAGGT 
AAAGCGCTAA ATCATATCCC CCGTGTCTTT TTTGTTCAAT GCTTATCTCA GAAATTTCAC 
CGAGCAAAAG AGGG TTCCGC TTGCTCAAAT TATCTAACTG CACAAAAAGA GGAACAAGCT 
GATCGTGCAC GCGTAGCCCC ACGCGCGGAT TACGAAATTC AAGACCACTT AC T AC CGGC A 
ATACTGTATC AAGAGGTGCC GTCCCAACAC TAAAAACTGT TCCTGTCTTG TCAATCTGCA 
CCGGCATCGC ACGGCC TTGA ACGTGCACAA AACCAAGTGC AATAGCAACA CGCTCTACCA 

CATGAACATG catcgtatct gg 
(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 
TTGCCTTTCA AATAAATAAG TATTTTTTTA AAAGGnGGGA GGGGGTTTAT ACTTCACAAT 
ACTATTATAG GACATGGTGm TTTATCCCTT GCGTAAAGAA CCTGGAAGTG TTAATGCAAC 
TTGCAAAAAA AGAGGCATTG GTTACTTGGA GTCACAGGGA GTCAAAGGAA GGTATTAACT 
CCCTTTTTTT GTTCTGTTTT CGTTTGTTTT CAAACAAGTA ACTGGCCATG ATGATACAAA 



3000 
3060 
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3300 
3360 
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CACATACACA ATGACTTGTT AAATTACCTT TTCAAACAAA AAAAGTTATT TCACCCCTGA 
GACCAACCAC CCATGAAACA AGGGAGAAAG ACAGAACCAA GTTAGAGAAG CCCCAAAAGC 
AACATGCTGC ATTGCTCCAA AGACTGCCAG GTTCCCTTGc AAATAAAGTA CTTGCAACAc 
CCCCCTTGAG CTATGTGGcT CTGTGTGTGT TTAcTAGCAA AGCCAGTCTT TGrAATCTTG 
AAAC 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3134 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 
TCCTTTACAC GGTATTGCTC CACCCTGAGG GCATCGAACG CGCTGTCAAA CTTCTCCCGT 
GAGCTCCCCG TTGCCAGAAG GCGATT AC t G CATCGGCAGG GTCCTGGTGT TGGTTACCGG 
CGTCGAGGGC GAAGGAGAAG CGCCGCACAT ACCCCTGCAA TCCCTCGCCT GTAAACTGAG 
CGCGCACAGC CCCACGGGGA AAACGCGCAC CCCCACCGTA CCCGTTCTCT GGCCCACACG 
ACAACCCCTC ACCAAACCAC CTTCACCCCA CAGATAAACG TGCCAAAGTA ACGCTCAGAC 
CACATATCTC GGCAtACCCA TGTaAGGAGC GTCAGCAAGC ACCCCCTGTT CCCACTGGGC 
GCTGAGCTCC ACTATCAGTG GCAACTTCGT CAGCGTAGAG ACGAGCGTAC CGTCCTGCAG 
CGTGAGGGCA AGCCCCACCC GCTCGAGAAT CGAGGACTCC GCCATCTGTC CCAGCAAAAA 
TTCAAGATAC TTATCCTTTA CACGGTATTG CTCCACCcTG AGGGCATCGA ACGCGCTGTC 
AAACTTCTCC CGTGAGCTCC CCGTTGCCAG AAGGCGATTA CtGCATCGGC AGGGTCCTGG 
TGTTGGTTAC CGGCGTCGAG GGCGAAGGAG AAGCGGAAGC CGGCGCCTGG TTCGAGGGTG 
AGTCGGCCTC CTACTCCCCA CAGGAGTGCT GTTTTGTTTT CGTTCGTGGA GTCTTcGGTA 
CCCTTACGGT AGTGCTGCTC CAGTGTGGCA TTCCCTGCCA GCTCCAACGT AAGCAGCCGC 
TGACGGTCGA CGCCATAGGA AAGCGTTGCA TCGGCCCCGA AGCCATACTT GCTGTGCGTG 
GTGTCAGTAC TATCCCAGGC ACCATTGGAA AGGAAGGAGA GGAAACCGAT GTCCACATCT 
ACTCCGCTGT TTCCCACATT GTGGGCCTGG TAGCCGAGTT TTGCCCCGGA GCCGGAGAAA 
CCAGGGGCAT AGCGAGTGTC CTTTTCTGAA TAGGCACGGG TGACAAAGGG TTTCCACAGC 
TGGGCAAAGT TAACCACACA GGAAGGACTG GTACCCACTG TCAGGTAGGC CCCATAACAG 
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TGCAGGGTTG CCTGGAAGGA AGCGGTAGGT TTGGTAAAGG ACAGGGCCGT TGAGCTTTTA 
GAAGACGCAA GCTCTACTGC CAGGTCCTTC AGCTGCAGCT GTGCCCACAC CCCTGAGCGT 
GCCTCCCCTC GGcGGGTGTG GGTGTGCTTT GACACCAACG GcAGGGAAAT AGTCAGACTA 
TTGGTAGTGC GAAACCCATG GGTGTGCTTG CCCGGGCCAG TGCGTGGATT CTTCTGGAAC 
GCAATGCCCC ACTGGAGCTG GGCTGTGCCA CTGACCTGCG GAGTGAGTAC GCCTGCATAA 
CCAGAAGCAG CACATACCAT GCCCGCAAGT ACCCCcGCTT GCATCACCTG CCTGmCCACT 
CACTCCCCCT CCTCTcACTT CTAmCTCACC CCCCCCCCAC CCGTCTAGAA GACACGGAGA 
GCTCTATCTC ATGAGCACCT ACACACTCGC CTTCTCTTGG GGGACAGACA GAACTTCCGA 
AGAGAGAACA ATAGGTTCCG GCGATGTTTC GAATAGGGTA AGGCGTTCTG ACGCCTcCTC 
TAATGACTGC GCCAGACGCG CTACCGTGGC TTTGCTTAAC GCAACGTtGC TgCGCCGATT 
CGATCATCGC CTCGTACTCT TTGATCCCCC GTCGATATnC CnCGGGGGGG GGGGGGGnGn 
nnnnGCACAC ACGCTGGGCn GACTGAAGCT TTTGACGGAG CTTGGCGCAC TCTACCTGAT 
ACCCTGCACG TGCCACGCGC ATCCGCGCTT CTGCCACGGT ACGCAACAGC GTGCTGAGTT 
CATCGGGCAC CGTCGGTCTT GTCTTCTGAG ATAAAGGAGG CTGAGCAAAT CCTTCCTCGG 
TAAAGAAACA ACTTGCGTAT GCAGCACCGA TGCGCGCGCA CAGCCATTGA TATCCTCATC 
CAGTTCACTG ACC TGCGCTG TGAACGCCGC AACGCGCTTC AGCGGCTTGG CACCACAGTC 
AAGGGcGCGC AGgCGgCGTC GAgTGCTTCT TGCTCATCCA TAAGCTCcTG GCTATGTTGA 
AGGTTGCTCG CGTAAACGCC GCGGTCGGAT ATGAGCCGTG CGTACGCGGC ACTGAGTGCG 
GAGGAAAGCT CGCCTGCGTG ATACATGCGC TCGACGTCCG GATGCGCAAT GACATCCGGG 
GTACAGAGCG TAATAATCTT TTGAATTTTT GCTTCGAGCA CGCGAATCCT GCGCTGAAGA 
ACCGCGCTCT TCGCCTGGAG GCCAACTCTT TCAAGAATTG AACCAAATGT GCATGTTTCC 
AAGAGCTGGT CACGCTTGGC GCGTAAATCC TGCAGAGTAG ACTCAAGCTC CGCCGTACGC 
GCATAGATCG GTTCGAGCGC AGGTAGGCCT ACGTGCGCGT AGGTGGCATA GTACTGGGCT 
ACAAAACTCC TGAGTACATC CCGCTCCTGA CGAGCATGGC GGTGCAATAC TTTGCTCACC 
CGCTTTCCAA GAGCGGCAAG TTCTTCTTGT CTTTGAAGTA TCGACTTAAT ATCAAGGATA 
GACTCAGCAA CCTGATCGCG CTGACGTTGA AGCGCGTGGC ATCGGCTGAT GTCAGTGTCC 
TGTACGCCAA GTCCGCTGAT GTCACACGCA GCACCGCCGC GCACAATATG TTCACCGAGA 
CTGCAACAAT GGCTCTGCAG ATCCTGCTGC GCACGCTGAC ACGCGGCATT CAGCGCGGAA 
AGACTCTTAT CCGCGAACAT GACCGCATTG TAAACACTTC CCCTGCGTAT GTACAGGGCA 



1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 
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1980 

2040 

2100 

2160 
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2280 

2340 

2400 
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CCTCACTCCC TCTTTACCCA TGCAAGGACA GACGTCC TGC CTAACTCTGG CGGGGTACTA 2 880 

CCTGCGCGCT CGTATTCGGG TACAGATTAT ACGCCCGATC CATTGCGTTA GGCAAcCTGC 2940 

GCCAGGATTG CAAGCACTGC CATTGTCATC ACCTCTGGTT GGAAGCGCTG ATACCCGTAG 3000 

CGGATCGCTA CGTCTcCTAA ACCTcCTcCC CCAATCGCCC TGCCATTGCT GAGTATCCAA 3060 

TTAGATTAAT TACCATCAGT GCTACACCAG AAAnCACATG CAGGAGCAGC CTCAGGCAGC 3120 

ATGATTTTTA GTAC 3134 
(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 635 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



ATTnTTTAAT 


nATAGGATCC 


CGTCACGTCA 


TACACAACTT 


TCTCCGCATA 


AAGGCACTCA 


60 


TCTAAGTTAA 


GCTGAAGCCC 


CTCATACAGG 


ACATCAATAG 


AACGTGCTAG 


GTGATAAATA 


120 


TTGTCTTGAA 


ACACAAAATT 


CTGCACCACA 


GACCTCCCCA 


CACATACGTG 


ACCGACGCGG 


180 


GCACACAGGA 


AAGCGCACCT 


TCTGCGAACA 


GGTTCCTCAC 


CTCGTCTGCC 


GTAAGAGCAC 


240 


ACGCACGAAT 


GTACCCCCTC 


AGACGACGCT 


GCGCCGTCGG 


CAGCATCGGA 


TAAAAGGACA 


300 


AAAGACTTGA 


TGCGGCACAC 


AGATTTACGC 


TACGCTGGCG 


TCTCGCCGCG 


CGTTCTGCGC 


360 


ACGCTACTCC 


TCCTTGTCGG 


GTGAGTCTAG 


TGCAGCCTGA 


TAAGGTGAGC 


CGTCCATAGG 


420 


AGTTATTATG 


AGGACCTACG 


AGnTAATGGC 


CGTTTTCAGT 


GCACACGAGG 


ATCTCTTTCT 


480 


TCAGGGTTCC 


ACCGCCGTTC 


GTGCCCTCCT 


ACAGGAAAAC 


GACGCATCAT 


nCGCCCGCGA 


540 


AGACCATATT 


GGAGAnCGGG 


AACTTGCGTA 


TCCTCTGAAG 


AAGCAAAAGA 


GGGGCCGTTA 


600 


nCTGCTCTTC 


ATTGTTCAGT 


GTTGAGCCnG 


GGAAA 






635 


<2) INFORMATION FOR SEQ ID NO: 174: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1644 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
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CGGTACCTTC GGTTCTTGGG CAGGGCTTAC TGCAGCCGAG CAGCTCGTCT TATTCAGCCG 
GTAACTGGCA CCGCCCACGT TCATTGCTTG GCGTGCTAAG TGCACTGCCA AGGAGGTAGG 
CGCCATACAC GAAGAGTCGG CGTATTAAAG GGGTCTGTCA GAACTATGCG GTGCCGGTGC 
AGCTGGGGGT ACAGCACTAT TTTAGCGCGC ACTGGGGGAT AGACGCGACG GCTACCGTTT 
CGTTTGGCAT TGACACCAAG CTGGCTAAGT TCCGCATCCC GTATACGTTG CGCTTTGGCC 
CCGTCTTCCG CACCTAGGGG ACGGCGCTGG GArrAAAgAG TCCTGCCGGA AGGccCTGCG 
GCGGGTAGTA GCTACCAGGA GAGGGTGACG CCGCACACGA TGCGGCCGAT TCCCTGGGTG 
AGGCACTCGG ACACCAGCAG GTACGGGACA TCAGAGAGCA TACCCTGTTC CCAATCAAGG 
GAGAATACCG TCTTCTCTAT GAGACTGGCT GAAATACCAG CACGCAGCTG TGCACAGTAC 
TCCTTGGTTA GATAGGTAGC TCCTACTGCT CCACCTGCAG CAGGGGCATT CAGGTGTGCA 
CGGTTGGTAG AGGCATGGAC CGTAACGCTT GGCTTCACCC AGCCGTAATC CTGCACCGGG 
ATGCGATAGC TACACCACGC CTTCCCCACC ACCGGTGGAC GGATATACTC CTTTTCCTGA 
ATGCCACGCA CAGCCGTCCC CCCGTTATTT TTGTATAGCG CATAGGTGAG GGGGATGTAC 
ACGCGTGTTT CAACGCCGGC GTCCAGGCCG GTGAGCAGGT GGGTGTAGGG GTCACCGCTC 
TTAGTTTCGA GCTTAAGGAA TCCGGCAAAG TCGCCACAGC TTGCGATGGT GTTATCTAAC 
AC CCTGGTGC CAAAAACGTT TGCCGGTGCT GTGGCAAAGT ATATGCCAGA AGACAGCCAC 
TTCCACTGCG CCGTAAACAG CGCATCGAAG GCGACATTGT AGGTGTCAAG ATACAGACAC 
ACGGCGCTGA CTCCCATTAG AAAGGCACGC CATGCAGACG CACGCAGGTT CTGTATAGCC 
TGACGTATCT GCTGCCCCGC GTCCAACGCA TCCGTCTTCT TCTTCACTTC TTCGGTTACA 
AACGTCTGAC CCTCAGTGAA AAACTTTGTA GCCTCAGCCG TAAGCTTTGG AATAAGATCG 
GCGAGATCGT CCTTCAGTAA TTTTTGTTGA TCCGGAAGGT TGAAGAGGCT CATCGGATTT 
CCCTTATGCG GTGCAAGCAA CGCATCAGCC ACGTCTTCTA TTGCCTTGGT AAGACCGTTG 
ATGAATGCCT GTGCTGCAGC CTTGCTCTGA GCCACAGCCG CTTGCACTTT CTGGTTAATT 
TCAGCAACGA TTTGCGTCTG TACCTGCTCA AACCCCTTCA CCACCTGcTC CGCATCGTAC 
TGCAGCAAAA CCTGCCCCAT CAGGGAAAAT GCAGGAAGCG CGGGAAGCGG TGACGGTGTA 
GGAGGGTCGG TAATGAACAC CTTCAGGTTC GTCAAAAGCG TGGCAGCTAC CGGAGGGGTG 
ACTTTTACGT TTGACACTGC CGTTATGTAC GCGTTTCCCC TCGACTGAAG GGCGGTTTCs 
GTGGyksCgT TACGGCATCA AGCA 
(2) INFORMATION FOR SEQ ID NO: 175: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

CACGTACCGT CTCTGCAACC GTAGAAGCCT TCAGAGAAgG ACAACCCGAT CAAgTGTAAC 60 

TGATC TAG AT ACGCAGCTTC TGCAAgGCAg TTTGCACCAA CGCTTCTGCA CGCGCATTCA 120 

cTGAcTGCAT CCCATCCGCG TAACGCGCCG CAAtGCGCGC TTCACCACAC CCGGCAGAGA 180 

ACCGGTATTC ACCCCAATAC GCAAAGCAGT GCCGGTTGCT CGAGCCTTTT CTACCACCGC 240 

ACGCACGCGC TCACGCACAC CGATATTCCC CGGGTTAATA CGCACTTTCG CTACCGGTCC 300 

GTCCATACAC CGCAGCGCAA GCCGGTAATC AAAGTGAATG TCTGCAACAA GCGGCATACG 360 

CGTGCGCGcG CAAAGCsCcA CAAAAAGCTC AGCCGATTCC CTGTCAGGAA CGGCAAAACG 420 

CACAACGTCA CACCCTAATT GTTCAAGTTC CAAAAGACGG TCCACAATGG ATTGGAGATC 480 

TGCACCAATG AGCGGTTCTT TCCACATTGT TTGGATAGGA ATCGGTGCGT CACCCCCAAG 540 

AGGCAACGCA CGCACATGTT CCTTTCCTCC AATTACCAAT GATCGTGCAC GGTAGTGGGA 600 

CGCACGCGGT CTTAATTTTA ATGGAACATC CAAAAGTCCA CTCCCATACG GACTATTGCA 660 

GGGAGACACA CCGGCAGATG AATCCACCTT TTCCTCTGGc tGCCGCGCAg CACGCTCGTC 720 

CCTCTGATTC ACCGGGCCAC CACTTGCGCA CGCACACTTT GCGCAACTTC CACTCCACAG 780 

AGTGCTTCGA TAACTGCGAA AGAAAACTCT TCTGCCGCAC CTGCCGCACA CGCGGTCAAA 840 

AGATTTCCAT CACGTACCAC CCGCGCGCGC TCAGGTTTAC GAAGCGCACG CGATTTCTCC 900 

TCTTCCGTCC TTTTCCCAAC CCCATCGTCA TGTGCGGAGA ACACCGCCGG CTCCATACCC 960 

GGATAGCAGG TATAGCGACG TGACCCCAGG AGATTCCACG CAGAGAGCAC TCGCGCCGGG 1020 

GCAGCACACA GCGCAGCGAC GAGTCCTCCG CGCAGGTGCA CGCGCATGAC AAAATCACGC 1080 

ACCGCCGCAC AGGCGGCAAG CGTATGACAG TTTTCCAACC CCCCGGGAAG AAGAACCGCA 1140 

TCTGCGGCGC ACGCGGCATC CGCGATGCCC GGAGAAGCAC AAAGCGCCTC AAGACTGCAG 1200 

TCGCAGCTCA CACGCAAGCC ACGCGTAGAA ACAACCTGCT CTGCCCCAAC ACCCACGAGC 1260 

GTTAACGCTA TCCCCGCACG TCTGAGATAA TCCAACGGGG TGATAGTCTC CACTTCCTCA 1320 

AATCCGTGTG CAACAAAAAG GTATACCCGT ACGCTCACCG CAACACCCCC GTACCTGTTC 1380 

TTACAGCAAC AGTGCAACCC CGCGCACACA GACTACGGAT AATCGTCCTC CGTTCAAAAG 1440 
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CGACGTACTA GCCGTGCGCC CCGCGAGCGC AgCACTACCT TAGGATAGAA GAACTAAAAA 1500 

CCACGCGGAG CCTGTGCCGG ATCTATTGGT TTATTTTTTT TATATACCAT AAAGTACAAT 1560 

CGTGATCGCG CCGCTGCTGC GTCAAACCCC AAATTACCGA GCACGTCGCC GGCAGAAACA 1620 

TAATCTCCTG ACTTTGGCAA AATACGCTCA AGTCCTCCAT ACACGTACAC GTGCTTTCCC 1680 

GCCGACTCTA CAAAGAGCAC CTGACCATAA CCCCGATGGG TCCCCCTGGA GATTACCTTA 1740 

CCAGACATGA GTGCACGCAC GGCAGCATTT CTTTCCGAAT CAATAACAAC CCCGTAGgTC 1800 

TTGCCCCGCA CGTACGCAAG CGAAGTCGGG CTCACCGGCC AGCGCGCATT CTTATCAACT 1860 

TTCTTACTGA TATACTGACG CGGATCACGC AGAGAAATTT CTCTCCTCTC GGGGGCCGGC 1920 

ACAGCAGCGG GGGGAGAAAC AGGAGTCTGT ACGGGATCGG CACGGGCACG CGCCCACTGA 1980 

TCCCCATCGG GCAAAGAAGA CACAGATAAG ATGCGGTCCG CGCCCGCCAC GGTAGGAGGC 2040 

GACTTTTCAC GCGGAGGAAT CACGAGCACA TCACCTGGGT GAATAGTGTG CGCAGCAGAG 2100 

ATTCCGTTGG CTGCAAGAAG CGCGGCAAGT GAACAGTTTA GCATACGGGC AATCGAAAAG 2160 

AGGGTGTCAC CCCGGCGCAC CGTGTATCCT CGTGGAACTA CTATGCGCTG CCCAGGCACA 2220 

AGCTGGTGTA CGTTTGCCAG ATTATTTGCC TGCGCAAcGC AGAAAGGGGC ACGCCATAGC 2280 

GACGCGAAAG TGAAAAGAGG GTCTCGCCCT TAGCAATCAC GTGCACGTCT GCGCCGCGCC 2340 

GTAGGCTGCT GCCAACAAAA AACACCCCGC AAGCGACAGA AAAAAACCGC CCATACCCGC 2400 

CCCCTTCTCG GCAGATAAAA AAAAAGCATG AGCGTCACCT ATGCGCGCCC GTTCCCCCTG 2460 

TCGTAAATAA CAAACGTTCA CCCGCAGCCG AGACACTCCA CAGCCGGCAG GAGCACGCAC 2520 

TAACCCTACT TGTCG 2535 



(2) INFORMATION FOR SEQ ID NO: 17 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1226 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

TGGTTCTTGC GCCGATTTGC TGACGTACTT GAGTCCAAAG ATGCGCAAGG AACTGGAAAA 60 

TGGAACTGAT TAAGGATTAA TGCTCATGGC AAGGAAGAGG CGTGCTCCGC GCGGGGCGGC 120 

ACAAGGGTGG CTAACGACAT ACTCAGATAT GGTCACGCTT ATGCTCTGTT TTTTTGTCAT 180 

GCTGTTTAAT CCAACTGAAG TTGATATCAC GGTGTTGCAG AGTATTGCTG CATCGATTGT 240 
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AGGTGATCCT ACCGGTGGAG GGGTTTCTGC CTCCTCAGGG AGGCTCGCTG ACTTGGGAAA 
CACCGTTAAC ACGCTGCCtT CACTGGAAAA GGGACAGAAG CTGGCGACTG CGCTGAAGAA 
GGCGGTTTCG CTTTTCGCTC CTGAGATTAA AAGCAATAAG ATTGCGGTGA CCAGTGATGA 
GCGCGGTTTA GTTATTTCGC TCACTTCGGA TTCGTTTTTT TATCCGGGGT CTTCCGATCT 
GAATGTGGAG GAATCTCGGG AGGCGTTGTT GCGTGTTGCG CAGTTTTTGT CTGATCATGC 
GCTCGCCGGT CGACGGTTTC GCATTGAGGG ACACACCGAC TCAGTTGAGG TGCCCGAAGA 
TGGGAGTACA GACAATTGGG AACTTTCTAC CCGTCGGGCG GTGCGCGTGT TGCATTATCT 
TACTGATTTT GGTGC aCAGG AAAATCGCTT TTCCCTTGC a GGGTACGCAG ACACACGCGC 
AAAATTTTCA AACGAAAGCc TGAAGGGAGG GCGTACAACC GGCGGGTTGA TATTGTCATC 
CTGGACGAGG GTCACTTTTG ATGGTACACT TCCGCTTCCC CTTTCCCGGA GGTGAGTTTC 
CGGTATCTGC ACAGCCTGAG TTTGAGCTGG TGGCgCGTCT GCACGCCGCT GTAGTACGTG 
AGGAGGATGT ATGGCAGAAA AGGACTCCAT AGGAGATATC GCTGATGATT TTGAGGAACA 
GCTTGTCGCT CCTGCTGCGG ACAGGGTGGG CTTTCTGCCA GGATTGC TC A GATGGGTTGC 
CATTGCAGTA GGGGCGGTCA TCTTCATTGT GACGGTGGTG ACAGCCACCG CGCTGGTGCT 
CGCAAAGCAG GGGAGTAGCC ACACGGCGTA TCCGGTTcAC AGGAGTTTCG GgAGTCTCGC 
GAGCTTTTGC AATAcTACGA GTmCATGGGC CTATCCGTAC CAATACTGCA GATGCGCTAC 
CGGGGACGTT GTAGTGAGCG TTGCGT 
(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1079 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 
ATCAAATATT CCGGGGATAT CGTCACATTT CCTGAAAATT CTACCTTTGC AAAAGAAGTT 
GCCGGAACAA CAAAAAGATA AAATACAAAC TCCACACACA ACACGAGCAG CAATATAACA 
ATTACCACCT TCAGTACTGC AACTCTCACC CGCATAGACA TATACCCTAC CAGAGGGTAC 
TTGCACGCTT CAGTATTTCT ACACAAAAAC TGTTTCTTCA CGCTCCCtCC t T ATCTTCTC 
ATCCCCTGAA ACATTGATGA TCAATCCACA TAAAGAAAGC GTTACCACAA TAGATGAACC 
TCCTGAAGAA AAAAAGGGAA GAGGAATACC AGTGGCTGGA ACTAGACGTA CCACCACTGC 
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AACATTCAAA ATTGACTGTA AAACAATTGC CGCCGACGCG CCGAAAGCCA AAAAAGTATT 
AAAGCGATTA GCACATCTGA GTGCGATAGA AATGCCAGTG AGGGTAAACG CAAACAACAA 
CATTAAATAT AAACACACAC CAATGAATCC CATC TCTTCT CCAATAACGA CAAAAATAAA 
ATCCGAATAT ACTTCGGGCA CGCTCGCAAT TTTCCTCACC CCATTTCCAA TACCACGCCC 
CCACAACCCT CCATCCATCA GTGCCTCGAG CGCCGCGTTT ACTTGGTATC CTGCGCCAAG 
CGGGTCTCTA TCTGGATACA AAAATGAAAG CACTCGACGC AAACGATTGG TGGACGTGAC 
GATCATAAGC ACTGCTATCG GCGCAAGAAC CATTATGCCT CTTAAAAACC ACCACAGAGG 
CGCACCTGcA ATGAAGAACA TAACCACTGT GATAAAAAGC AAAAACATAG CGGTAGAAAA 
ATCGTTTTGA AAAAAAACTA CTGACACAAA AATCACGCTC ACAACAAAGG GAGGAAAAAT 
TGATCGTATA GGTGTATCGA AATGCTCCCG GTGCTTATCA AAAAAGTTTG CAAGAAAAAC 
AATGAGTACC AGCTTCACAA ATTCAGATGG CTGAAAATTA ATATCAAACA CCTTAATCCA 
ACGCGTCGCT CCATTGCGCG TTGAACCAAT ACCAGGGAAA AACGTGCACA CACAGAGCG 
(2) INFORMATION FOR SEQ ID NO: 17 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 556 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 8: 
GnTTTGAAGC CGTAGGCATC TGTGCGCTTG CAGAATGTAT ACATGCAGGG AG TGCGCGT A 
CGTGCCACGT GAGTnCTTGG GGGAAGCAGC TTATGTTTAG ATCGCGGGGG GGGGGGGGGG 
nGnACTTGTA GTGCTTGAAC GGGATGTGGA CCGAGTACTG GAGTACCTGG GAAAGACGGC 
GCTTGTCCAT TTGCGTCTTT CCGCCGCGGC GCGTGGCAGT TCTTCCCACT GTGCGCAGAG 
CAAAGAGTAT GTCGGCCGTC TTGAAGAAGC GTGTAAGTAC CTGGGTGTCT CTGGCGAGTG 
CGCGTTTTCT CCAGGGGATT CTTTGC CTAC CGAAGAAGAC TACACGTTGG CACAGCAGAT 
ACTTGCAGAA GTTGACGCTT TGCACGCACG CGAACGAGAG GGTGATGCTC CCTCAGTTCC 
CCGTGGGAAG AGTTCTGTAG CCCATGATTC TGCCAACGAA GAGCAGTTTC AGGGTGAGAA 
ATGTGCGCTC GGCTCGATGC GAnGCCCGGC ACTGTGTGCG CTGCTTAGGC GTTTTGCGCT 
GCAGGAAnGT GTGCAC 

(2) INFORMATION FOR SEQ ID NO: 179: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 
TnTGACTAAA GTACATGCTT GCAGCGATAC TCAATGAAAT TGGATTCAAC ACCGGGCTAT 
ACGCTTCTCC TCACGTTATG GATCCGAGAG AGAGAATAAC ACGCGCGGGT GTGTTTTTTT 
CACCTGCCGA GTATGCAAGC GCGTGCACAC ACGTATACCA CACGGTGAAA AAAACAGAGA 
ATCTGCGCGA CTACGGCCAG GCGACGTGGT TTGAGCTTAT AACGCTACTG GCGTTCATGC 
TATTTGCACA ACAACGCATG GAATGGTCCG TTTTTGAAGT AGGACTTGGA GGAAGACTAG 
ATGCAACAAA CATCATTTGT CCTAGTATCT GTCTCCTTCT CCCCATAGAA CAAGAACACA 
CGCGCATATT AGGAACACGT ATAAAAAGTA TTGCAAAAGA AAAGGGCGGC ATTATCAAAC 
CCTATACGCC TATTTTTGTT TTGATCAGCC 
(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 
TGGGATCTTG CAACTGACGC TCCCCCTGAG CGTGTGCGTA TGCTTTTTAn ACGCACGGTT 
CCTACTGGnA TTGAACACGG CACTGTTACA ATTCTTTTCA nAGGGCATTC TATTGAGTGC 
ACTACGTTCC GTGTTGAGTC GGATTATTCC GATAGAGACA TCCGGATTCT GTTTGCTTTG 
CCGCGCGCAT TGAGGACGAT TTGGCAAAGC GCGACTTCAC TGTCAATGCT TTCGCTGCCG 
CGCTCCCCTC GGGGGAAATC ATCGACGTAT GTGGCGGTTA CGCGATTTGC GTAACGGTCT 
TATCTGTAGC GTTGGGGATG CACATGCTAG ATTTTCTGAA GATGCGTGCG TCCTTTGCGT 
GGCTGTGCGT TTTGCAGCGC ATTATCTTTT TCCATCGAAG CGCGCACGnT GAAGCAATTA 
TCGCGCTACG GCTTCATACT GCACGTAATT CTCGTGAnCG GGTGCGTGAT GAACTTTCTA 
AGATGCTTTG TACTCCCCGT CCGATATTGC GCTCCGCTAA TGGAAGAGAn TGGGATGCTG 
CAAACACTTT TTCCTGCGCT GGCGCAGTGT GTGGGAGAAA CGAAnGGGAA GGAGAAGGAG 
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(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 
ATGnCGTTTA AACGAAAAGG AGAGTTCGGT TTTTGCACGT ATTGAATAAA TGTTTTAAAC 
CAAGTGGATG CATTGTCGTG TGTAAAAAAT AATTTTTGGT AGCAGTCGAG CGGAAGCATG 
GAAAATGAGC ACAGTACCGG GGGTGTCGCC GTACCGTGCG ACAAGTTGCG CATAGAATGT 
TTCAGCGGGG GTGTCAATCG TCAGTTCGCG GTACGTAATG CAACAAGGCA TTTGCTGCTG 
TCCTCAAGTA CTGCAGGTTT AGTAGCGGGC ATATCAGAGG CTTCGAAGAG ACGGTAATCT 
AGGAGACAGT ATTCCATTCC GGCCTTTACG AGTGAGGGAA TGAGCGCGGC GCTCCATGCA 
GATGCTTCCA GAAAGCAGCC CCGTGGGCGT TTTATAACGT GTTCTCGCAA GGTAGAAGTG 
AGTAGCTCAA TTTGACTAAT TACATCCGCC TGAGGGAGGA GGGGAAGATA CGGCTGGTAA 
AAACCGCCGC TGAGTAATTC AAGCCGTTTG GAACCAAGCA GATTGTTGAT AAGATAAGGA 
TAGGGAGAAT TCTTTTTTTG TATAAATTGA AAAAAAGAAC CGGTCATGTG CACCGTCAGT 
GGGAGTTGTT CGTGCCTGCA CAGACCTGAA AAAAATGCCT GGTACTGAGA AGGATCGTGC 
TCTGCGTGCA GGGACTGAAC ACATGCTTCC GTTAACTTCG CAGCAAATGC GATACGTATT 
TTGGTATTGT CTCCTTTTTT CATAATCGGG TTCTCATCCC CGCCGCGGGT ATCGGAAAAA 
GCCACAGCTC GTCAGTACCG TACAGGCTCA GTGCAATAAA mCCCATACTT GCTAAGAGAA 
GTGCAGCTGT TTTCTCCACT GcAGTGCCGT TTCCCTGCGT GATGCGCTGA TGTATCGCGC 
GTAAGATGGT GGAGAAACAT AAAAAGCCGA CACAGCTCAC TCCCCACATA ACGAGCGCCT 
CAAAAAGGTT AAGGGCAGTG TACGTAATGT AAAATACGAG TGCGTATAAA AGACCCCACT 
CGCTCTGAGC ACGGTGTGAT GTACACGTGC GCGACAGTTT TCCGCTGCAT GTACTCAGAC 
AAAAGGCACT TGCGCCCACG CCCAACGGTG TAAGAAAACG CAACGCAAGC GGGTCATATA 
CGTAATGTAT ACAGACCCAG TTACCGGTTA TCAATATTCC GCnTAGGAAA AGGAAGCGAC 
ATGTATGGAA GAGCGTGCAA CGTGGAGAGA GCGCAGCGAT GCACAGGCGA TCTAATACCG 
ATGCC 
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(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 




>8/13041 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: \ 

GCACCCGGAT GACCGTTCnT GATCGAGCGG GTACGTCCAC CGTGCGCCCC ACACCGAATG \ 60 

TnGAGCGTCC GCGCTCCCCC CTGCATGCGG TACACAAAAA ATCCGCTCnA CGCCGTCAGT 120 

GCCAGCACCA GCGCCACGCA GCACCCGTCC AGCAGCGTCA GCATTCCGCC GCGCGCGGCA 180 

GCCGACCCGA TTCGGGGGAG AGGTGGAGGC ACGATCATTC TCGCCGCAGG CCACGCGTCC 240 

CCCCGTTAGC GCTCAAAATA CGCACGCGCC TGCCGCGcCA GACGCCGCCG CCCGAGCATC 300 

CCCCGCACCC CGCGGGCAAA AAATACGTCC AACACTGCAC GGGCGTGCTG CACCGCCGCA 3 60 

TACCCACGCG CCACCAGCTC tGCGGaCGAG AAAAATCACT CTGGTCccGC CGCGTTTCTA 420 

CCCGCACACA CGCGCTCGGG TAATTGCGCG CCTGCGTGGC AACCGCATGC GACGCACACG 480 

AAAGACTGCG CACAACCACG TCAAACCCCG TCCGAAAGTC ACTGAGCCCC AGCTGCTCAA 540 

ACCGGTCAAA CGTCACCGCC AGCACCGCGT cAAAACCCTG CGCGCGCGCA ATCCACACCG 600 

GCGTGTTGTT CAATATACAG CCGTCCGCCA AATACACCCC CTCCTGCCGC ACCGGCGCAA 660 

AAACACCCGG GTACGCACAC GAGGCGCGCA ATGCACGGGC AAGCACCCCA CTGGAGAGCA 720 

CAACCTCCGC GCCCGTACAC AGATTGACCG CATTGCACAA AAACGGAATT TTACAATCGT 7 80 

GAAAGGATTT CCCCCCCGTC ACCCGCGTCA gsAGcGTGGC AAACTTTTCT CCCGAATCAA 840 

GCCCCAGCCC CCGCACGAGC GTGTTCAAAC TCACCCCCAG CTGCACCAGC TTGCCCAAGC 900 

GTTGAAACGC AGCCCGCGCC CCCCACACCA TCCCCGCCTC AACGCACGCA GAGGGATCCC 960 

GTGCATTCAC ATAGTCTGAA ATAACAAAAT CACGCTGAAA AAACGCCTCC ATCTCCCGCA 1020 

CCGACATCCC CAGCGCATAG AGCGCCCCCA CCACCGCACC CATAGAACAT CCTACGACAC 1080 

ATTGCGGCGG CGGAACCTGT AGCGCTTCAA GCGCCTTGAG CACCCCAATG TGGGsAATTC 1140 

CCCGCGCACC ACCACCTGAA AGAACGAGCG AsCACTTCAC GTGCGGTCAT TATGAGCGTT 1200 

TTCCTCCCTG CTGTCCATTC TCCCCCCAGT GTGATACCGT TCCAGTACGC AgTATGGAAT 1260 

CGTTTGTACG CAGC GCACTT GCGGCGCGCA CACTCCCCA 1299 
(2) INFORMATION FOR SEQ ID NO: 183: 
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(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 1115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

TTAACCGGGC GTTTCCTTGC AGGTTACGCA GGATTGCTTG CAGCAAGCGG GGCAGAGGGT 60 

GCCTTTGATT GGC AGAGAGC GGACGC G ATT GCAGCGTTAG ATCAGGCAGG AGCGCAACAT 120 

GCGGTTTCTC CTATCGCTCA TCCTGCAGGg TGGATGGAAC TTAATCCGGC TGTGCTGTCT 180 

GATGTGTATA GTGTGGCTGC AGGATATCCG AGTGCGGAGG GACTCCCTCA TGCGGGGGAT 240 

AACCGGGCGG CGCTTGCAAT CGC ATCGTTG AGAAATAGCC CGGTTATGAT TGGCAATCGA 300 

CACACGTTTG ACGAGTTTTT TGcAGAAGTG ACAACAGCAA TCGGTCTTAA GGGAGAACAr 360 

GCGGAgCGTT CGATGCAAAT GCACGCTGCA ATTCTCAAGG AGCTCACAGA TATGCGTGAT 420 

GCGACGTCGG GAGTGAATAT TGATGAAGAG TTGGCGGACA TTATTAAGTT TCAGCACGGC 480 

TATAATGCGT CTGCGCGTTT TATTGCGGCG GTGAATGAAA TGCTCGACAC CGTCATCAAT 540 

CGTATGGGTG TTTAATTTTC AGATAGTGCA CGGTATGAGG AAGGGAGGAT GGGCGTGAAG 600 

CGTATCAGCT C AC AC ATGC A GGGCACAGAC AGTGCCTTTT TCTTAAGGGA GCAGGAAAGT 660 

AGAC T ACGGA AGGTAAACAA TCAGCTTGCA ACGCAGCG T A GGATCCAGCA GtTCGCGATG 720 

ATCCGCTCGC TGCAGGTCAT TCTGTGAGGT ACAAGTCGTC CCTGGCGCGT TTAGATCGCT 780 

TTGAGAGAAA CACGAAAACT TTACGTGACC AGTATCAAAT CGCCGAGGGG TTT ATG AC TT 840 

CTGCGCTGAA CGTA t ACAGC GTCTTCGGGA AATGgCTGTC GCAGGAGCGA ACGGAACCTA 900 

TACTCCTGAC GATTTAAAAA AAATGGCGAG TGAAGCAGAT GAGCTTTTAC AGGAGCTGGT 960 

GCACAATGCA AATGCAGTGA GCGCAGATGG GGTGCGGGTA TTTAGCGGTA CCAAAGTTTT 1020 

CACAGAGCCC TTTGAAACGG TCATGGGGAA TGTTGAGGGA TTAGGGTCTG AAGTGATCAC 1080 

TCAGGTACGC nTnTTTCCCA AACCGGGGGG TTTTT 1115 



(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

CAGCCTTGGG CGACTTCCTC TGTGAATGTT CAGGCCTATA CTAGGCAAAG ATTTGAGAGG- 

TTATCTCGGC AGACCTCCGC AGCTCTCCCT CCACCGTAAC AATCTCCTCT CAGGACCATG 

ACCCATCTTT TACCCTCTAA GCTCAGATCT CTCACACTTC ACAGATTCCC AGGCCCTGGC 

TTGGTTCCTC CTCCCTGCAC GACAGCCTGG AAATGGGCTC TTATAAATCA GTGCAGTTGT 

GGGCATCACC TTGGCCTTCC CCCATCTTAG GGATC AT AGT GCTAAATTGC ATGTTGTCCA 

AAATCTGAAA GCCATgTTTT ATGTATTTCC CCACTTTTCT CATGTTTAAG GTGATGCAGT 

TAATCTaCCT CTGTTAcTCC GCCTgACTGA AATTGGAAGT CCCgTCTgTG CTTCTCTCCC 

AGATTcATAC CAkGCTCAGT TAnCTGGTCA ATTTTGTCTG TTGA 

(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 580 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 
CGCAGGTTTT GCCGTGTAAT CTAATGCTTG AgCtCTGCGG ctAACTTCTG TCCaGAaTCA 
GCAaTTTCCC CAAAAAACAC GCTACaTGCm TcAAaGGCAT AATCACGTAG AGCAAACAGC 
GCATATACGA TAAAAAAACC AAATACAAAC TGCATATCAA GCGTAGAAAA ACGGTGCACC 
CAAAATACCG ATATGTAACC TGCTGTACAT GAAAACCCCA CCGTGCATAA CACAAAACGC 
GATAGCGCAA ACACCACTTC TTTAAAGGAA ACAATTC C AG TAGACGCAAA TGCCCACTGA 
GAAAATAAAA CTATAAATAC ACCTCCTAAG GGAAAAAACA CAAACACCCA GGGAAAAACC 
GcAAACAAGA ACAAGATAAT aACCCACGTG aTaAAACCCA AAGCCACTGc CCCCGcGAGa 
AACAGGAGTT GTTGCCGATA TGcAATACTT TTCATCATGA ATGCTTTTTT cACCAAAGTT 
GCGAACGTAA aCAGGGAATG ATTAGAAAAA AAAGCGCCGC GTTATAACAC AnTACCCGAA 
ACAGAAGAAA ACAAATAATT TTGGTAAAAT TTAAAACTTC 
(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 
AAACCGCGCT GCGCAGCCTG GGTTAAACGT TGCGCTTGAA ATAGAAAATC AGCAGGTTGT 
CGGGsCGAGT ATGGGGGAAG AGAGTATTCG GCAGGGTACG CGCGCGCTGG TGTGGGGGCT 
GTGCGCGGTG CTC TTATTTA TGCTGGTGTG GTATCAAGAA GCGGGCGTAA ACGCTTGTGT 
cGCGCAgCTT TTGAATCTGT ATATCATGTT CGGTGTGCTT TCAGCGTTTA ATTTGaCcTT 
ACGCTTTCGA GTATCGCGGG GATGATTCTC ACTATTGGTA TGGCAGTGGA TGCGAATGTC 
GTTGTCTTTG AGCGTATACG GGAAGAACTT GCGCTGGGCA AAAGcCGCGG GGCTGCTGTG 
TGTArCGGCT TtGAgCGTGC GTTTTGGGCA ATTATGGATT CAAACGTGAC GACGTTTATA 
GCAGCGCTTT tCCTTTCGgT GCTCGgTACC GGTCCTATtA AGGGTT t CGC ATACAGTTTG 
GCTATCGGGG TGGTGTCCTC CGTATTTACG GCATTGTTCG TTTCCCGTCT GATGTTTGAC 
TACGGGACGG AGGTTTTACA CAAAAAGACC GTGCGCATTG GATGGAGGAT TGCTCGCGTA 
TGAGACAGGT GGTGCGTTTC AGTTTGCTGT TCCTGCCATG CGCGATACTC AGTGTAGTTC 
TCATTGGTGC GGGAGTGCTC CGTTGGGCAT TGTGGGGGAT GAGCTTTGGT ATCGACTTTC 
AGTCTGGTTT GATTGAACGG CTGAGGATAG CACCTCCTGC TTTCTCTCTC GTGTACACCG 
GAACGCATCG ATGCAGTTTT TTCAGGATGA ACAGAAGGTT GTGTTTACTG TCTCTTCGCC 
TGGGGTGCTC GGTGAgCGTT ATGAATTTTT GTATACGGAG TATCCAACCC TTCGTGCCTT 
CTCCGAgGGA GCAAAGAAGG TGGAGCACCT CAGTGTTACG CTCCATGcCC CTGAgACTGT 
GtAcATGCGT GAwACATTCT CCGGGGCGGA GGGCTCCACG TTGTCGAGTG CTTCGTGTTT 
TGTGCATTAC TTCTCGGAGG ACGTTCGTGC GCCAGGGGTG GAGGAGTTGC GCCGTGTGCT 
GAAGGATGTA CCGTCTGCGG TGGTACAGCA GGTAGGGGTG CGCGCTGAGC ATACCTTTCA 
AGTTCGCGTT GCAGCTGAAA CTGCCTTCCC GTCCTCCCTT TkGCCAGAGC AGGGAGGAAC 
TGCTCTGGCm CAGTCCGATG CTCCCGATCT TGTTACCCCT CAAGGTGCGG TGGAAAGCGT 
GGTGTAaCGC GGCGCTCGTG CGCGCGTATG GAGCAGATCA t GTGGTCCGT TTAGCGATGG 
ATTTtGTCGG ATCTCGTTTT TCTCATCTGT TGGTGCGTTC AGGCGTTGTT GTTGGTT 
(2) INFORMATION FOR SEQ ID NO: 187: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

CTGCTGACCA TGGCGGCGGG GTGTCTGCTG GTGTGTGCTA AGTGTTCGAG AGTCCCTTGT 60 

ATTGTTAGAT TCCACAACTG CACATTGCGA TGTGCCTGTT CGATACAGTG AGCAATGTGT 120 

GCGAAnATGT CTGTATTCCG TGCGGAGTAC TCGGTGGTGC GCTGCAGGGT GCGCATGTGC 180 

CTGACGAGTA AGGAGAAAAA AAGGGACACG GCGTGTTGCT GTTTGTGAGC TTGGAAAAAA 240 

TGCAGCGTAG AATCAAGTGA GGTAAGATCn AnTCCTGATA CTTGAGGTTG AGTACCTACC 300 

GAAAC CCAAG TATCAGGAAT GATGCGTCCG ATACACCTAG CGTGCTGGCA CACATACAGC 360 

CAAAAGTACA TCACGGACGT TTGCACCTCT TCTAAAGAAA ACGACAAAAA CTGGATATAA 420 

ATAGTGGCAA GAG T AC AC C C TGTGTGTTTT TTGCATCGGT CGGTCTGTCA TGAAATACGC 480 

GTG 483 
(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

TTTTCAGCAT TACATTGGGC GCTTTCGAGA. ACTGTGGCAC CAAGGAGGTA ACGGACGCCT 60 

ACTCTATCTA TGATACCACC GGCCAAGTGA TGTGGGATAA AAGCTATCTA TTTGGGATGC 120 

AGGTGCCATT TCGTTATTAT GCAGAATGTG CACTTCGTTT TCATAAGACA CCTTTGTATG 180 

TAGATGTGCA TGTGCCAATC ATTTCTGATC CCTTTTTTTC TACTGATTTT TTACAACGGA 240 

tGAaGATCTC AACTGGTTTC ATCTTGACTT TAAGTCCTCG ATCGCTCAGA GAGCTTACGC 300 

AAAGCACCAT AAGCAGCTAT GACTGGAAAA TCTCTGCTTC GCTGCGCCCT GTCTGGCTTG 360 

TACTTCATCC TTGGTTGCGC GATTTTTCGC TCGATCCGAT TTCGTTCACC GTACATTTTA 420 

ATTCGAAGTC TGATAGTAAA AAAAACAACT CTTCTCCAGA GCGTAACTTC TTCTACCCTC 480 

ATTCGATGGA ATCTCGAGCA GGATTGTCGT TCTCTGGTAC GCTGTTCTCT CATGTGTGGG 540 

AAAGACAAAA ATCTCAACAA AAAGAATCGT ACGCGCCAAA AGArATACGT AATCCACTTG 600 

CATACACTCC TGCAGACGGG TTATCTAGGG AGGGwTCTCC TCCTGAACAG TCCCCTGnCA 660 

GTATCAAAGG AGAACAGCGA GACTGATTCT ACCTTCGATT TTTTTATGCC AGAATTTCGT 72 0 
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GAAGAAAATG AACGTCGTAC TGGTACTGAT CATGCGTATG TTTTTACGCG ATACGCTTTA 780 

GATTACAAAG GTAAAGGTGA CATCGTGTAC GATGCACAGT TCAATCACGG TTCGTGGGGA 840 

TGACGC 846 
(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 689 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

CTGnCACAGG nACAAACGGC GACCCCGGCA GTTTAACGGT GAACCCTAGG TTAGTAGCGG 60 

AAAGACCGAC GTACATGTTG GGTTCTGCAG AGCTAAAGAA CTTAGCAACG TTCCCCACCC 120 

ATTGCAGACC CACGTCGGTA CCCAGCGACA GGTGCGTTAG gCCCTGCGCG TCGCGGAACC 180 

CCGCCTTCAG GTTAGCCCCT ATGGAAAGTC CCCCAAACCG ACGGGAGAAG TTCACAATCC 240 

CCAAGCCCCC CAGTTTTTTA ATTGGGTTGG AGGCAGGCGT GCACACGGGC CCCGTGGAGG 300 

GAGAAAAGTT AAACCCGGAT TCGGGGAAAA ACATACGCAT CGACGCGCCG TATCCCCAGT 360 

TGCCCGACTG GCCAACGTAG GAAAGCGTTT CGGCATGCGA ATTGTTAAAT CCGACGGTGT 420 

GGGCAAAGGT TAGCTCACTG TGCGTCATGT TCGCACTCCC TGCTGGGTTC GCCTCAAAGA 480 

AGCTGGCATC GTTTGCCAAT GCGGTGAACG AACCGTCCAG AACACTCAGA CGCCcTCCGG 540 

AAnGGGAGGA AAcTGCGCGC CTCTTAAACT CACTCATCTT TGAGCGAGTC TTCGCCGCAG 600 

CTTCTGAAGA GGCAATCGCC ACGCTGGCTC CCGCGAGCAC GAAACCACAG ACGAGCGCTG 660 

CGCTCTTATA AAAAACAGAT TTGTAATGT 689 
(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 
TACTCCCCTC TCTGAACATG GGGAGCTGGG TGTTTGCGCA GACGCGGGGA CAGGTGGTGT 60 
GTCCGAAGGA GCGGCGCTGC CGTGCGCACG GAGGGGAACG CGTTCCTAAC TCCCTGCCGA 120 
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AACAGGTTGC CGGGTGCGCG GTGCCTTCTG CAAAGGACTC ATCAAAGGCT ACCTCAAGCG 
GCTACGGCTC GGGGTCCGTG TTGCGCGTAG ATGCGAAACG CATCGATTAT CTTCTGAATT 
TGGTAAGTGA GACGGTGATT ATCAAGGCCT CGCTCAATCA GAGTGCGCTg AATTTGGGGA 
GGTGTACACC CTATTCCAAA ACGCTAATGG CGCGTACAAG GAGCGTTTGC GTAAGTTTTT 
TGATAGGGTT CCCGCTTACT TAGAAAAGGT AAAGAACGGT CAGGACGCAG ATGCGGTGCG 
CAAGGGGATG ATAGCAGAGG CTGTCGGTGT CTTtGACATT TTTTCTTCGT TTGAGAATGG 
ACTGAAACAG TCCGTCACTA AGTTTCGGTC TTCTGCTCAG AATTTGGGGC GTATTTCTGG 
TGAGCTTCAA GAAGGTGTGA TGAAAATCCG CATGGTGCCT ATTAGCCAGA TTTTCAGTCG 
TTATCCGCGT GTGGTGCGCG ATCTCTCGCG GGACTTGCGT AAAGAGGTGC GGTTGGTCAT 
TGAAGGAGAG GAGACGGAGC TTGATAAGTC TGTGGTTGAA GATTTGCTCG ATCCCATTAT 
GCACTGCGTc GTAATTCTCT CGACCACGGC AT AGAAGC GC CTGAAGTTCG CGCGCGCTCT 
GGAAAACCGG CGCAAGGTAC GCTTcTCCTG CGCGCAACAA CGAAGGAAAT ATGATCGTaT 
TGAGGTTGCC GATGACGGGC GTGGcATCGA CGTGGAgGCA tGAAGACGAA AagCAGTTGA 
GCGArGTGTG TTGCAcCCAG GcAAGAACCT C AC TGAGGTT GA 
(2) INFORMATION FOR SEQ ID NO: 191: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 413 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 191: 
CAAGTACAGC TACGCCGCGC ACAAGCTCAA ACAAGGACAG AGCCACTCAA TCCACAGTGC 
ACACACAGCC GGACGCAACC GGCAACCTAT CTCACGCACG AACCTAACCG GGGTAACTCT 
GCTGGCAGAG GTCAACAGGA CGGGGATTCT ACCTCCGATT TGCACAACGT TCAAGCACTC 
ACGTACGCAA GATATACGCA CCGCTTTTTC CTCAAGTGCA CCCGGCGTAT TCCGATACCC 
CTTACCAAGC ACGGGGTGCA ACGGCAGGGA TGCCAnGGCG GCTGGTCCGC TTGAAAGCAC 
CAAAAGGAGT ATGCTTCATG ATTCATCATC ACAACATGAG CGTATGTTCT CTCCAAAGAA 
CGCTCGGACA CAnAAATTGT CCGTCCAGAA GAACnTGGAA GTTGTCTTCA GGA 
(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
942 



60 
120 
180 
240 
300 
360 
413 



(A) LENGTH: 503 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

TACATACCTC GCTCTGCTAA TCCTGTTACC AGCCCCCCTG GGCGCAGGAG TTGTCCAAGA 60 

AGTATTGCTC TAAAACGGTC AATCTGTACT TCGTACATGG CAATATCCGT GACTTCCTCC 120 

CCCATCGCGA TATTCAGGGC AGCTTCTCCT TCGTTCGCAT TAATGACTAC ATATCTGAGG 180 

TTATCTTCGG TAACCGGGAT ATCATCGTCT TCTATGACAA ATCTGCAGGG CTCACATTTT 240 

GTCTACAAGA AATGCTGAGC GCTTACTTAG AGCGTATGCA TGCCCAGTAT CCTACTGAGG 300 

CACTTGCTGA CTTTCTTTCG CGTGATCCGG TGAAAGCTTT TGCGTACCTT GAGCGCTACT 360 

TTATTATGAA CATGAAACAG AATAAGCGTA GGTCCTCATC ATCGACTATC TGAATCTCTC 420 

GTTCC CTC AG AAGATATGCA AACTAAGCGA AACAGATCGC TATGCTCGTC ACCCTCAATC 480 



GCGGGCAAAT GATCCGGTGT TCA 
(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1038 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 



CAATTGCGTC 


GGTTCGCCGT 


TCTTCTCAGT 


GCCTTGTTCG 


TGCTGCTTCC 


GTTTGCGCTG 


60 


CCGCGCTGCT 


TGCGCACAAG AACCTAGCGC 


GCGCGGCGGG 


TTGCAATCCT 


TTTTTTGCGT 


120 


CTGGGCCTCA 


TGCTGTGTGG 


TACGCTCTTA 


GACGGGCGTT 


CGTGGCGTAA 


CGAGCTTCCC 


180 


TTTCACCTGT 


GCCCCGCAGC 


GCTCATTnGA 


GGGTCGCTGT 


ATTTCATCAC 


CCGCCGACCT 


240 


ATCTnTTTCA 


ATCTGCTGTA 


CTTTTGGCAT 


TTTGGCTCTT 


TCGTTGCGGT 


ACTCTATCCG 


300 


GATCTCACTC 


GGGCGCACAC 


CATCTTGTAC 


GCGTACTTGT 


TCATGCTCAC 


CCATTGCCTT 


360 


GAGCCTGCGA 


TGGTCGTGTT 


CAGCCTGCTC 


CACTTGCGCG 


AsGCATTAGC 


AAGCGTGGCC 


420 


TGCAATGCGC 


ATGCTTGGCT 


TTCTTCTGCT 


TGCAGCAAAC 


GCACTCTTTT 


GGAATCGGAG 


480 


ACTCGGCGCC 


AATTACCTTT 


TCATTAGCAA 


ATACCCGCTT 


GAGATCCTTC 


GGGTAATCCG 


540 


TCCTTTTTTT 


GTGTATCAGC 


TGCTGTTTGT 


CAGTGCACTG 


TGCCTGTTAA -TGCTGGTACT 


600 
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CTACCTACCC TTCCGGCCAA GCCAACACGG AAGAAAC C AG CTCTTCGTCA TTTAGCTGCT 660 

CGCTGTGGTT CCATCGGACC CTTCTGCCAA GGCAGAgGGC ACGCGTGCCg TTGGGaCATC 720 
GTGGTGGAgT TGCAGCGCGC TtGcGTCAmA mGGCCTtAmT CCGCGGTCAA cGCGAACGTA 780 
TAATGGTGCG CtGTGACTTC TGTAmCAAAA mCGCTTATCA TCcAAGCAGA CCGCTCCATT 840 
TTACTTGATG TGCACGCTCC TGAGGCGGTA GAGCACGCAA GGCGCTCGTT TCCTTTGCAG 900 
AACTGGAAAA ATCTCCAGAG CATCTACACA GCTACCGACT CACTCCTCTT TCTCTGTGGA 960 

ACGCCGCGAG CGCAGGGATT CAGCCCCCAG AATGGATTGC ACAAACACTG ACGCGTTTCT 1020 

CACGGTTCAn nTCCCCCG 1038 



(2) INFORMATION FOR SEQ ID NO: 194: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

TTGTCGATAC GnGATATATC nTATCTGCTG GGCAAAATGA CCAAAATGTG GCAGTCCAGT 60 

TGCAAAAGGG GGACCGTCAA AGAAAACATA AGACTTCCCC TGCGCACGCT GCGCCACAGA 120 

CTGCTCAAAC ACCCGGCGTT CCCGCCAAAA GGCGAGAATA CGCCGCTCCT GCGCGACAAA 180 

ATCAACCTTT GGGTCCACAG GCGTATACAT ACAACCTCCG TTGCTCAGAA TCGCATAAGG 240 

AGCGTAAGGC ATTATATCAT TTTCGTCCTT CCTTTTCCCC ATACGTCTTA TGACCGGCGC 300 

CACACCTTTC CCCACCTGCA CCAGATACCC CACGTGTGGC GTAATCGCAG TGGCTCTGCC 360 

ATTACTGCAT GAGTATTACT ATGCAATAAT GCCCCACATT ACACCTTCTG CAATCGAATA 420 

CGAAAAAGGC ATCATCAGAA C 441 



(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 
CAAACAGCGT TTTTGCACGC ACATGATGCA ACAAATATTC ACTGACTGCC TGCGCTATGG 60 
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ATAAACCGTC TTCAGTTGCC GTTCCCCGTC CTACTTCGTC CATGATAACA AGGCTGTCGC 120 

GGGTTGCTGC ACGCAGGATG TGTGCTGTTT CACTCATTTC TACCAAGAAG GTAGATTCCC 180 

CGCGCGCAAG GTTATCGGCC GCTCCTACCC GACAAAAAAT ACGATCGACG GGGGTGAGCT 240 

CTGCCTTTTC TGCAGGGACA AAGGAGCCAA CCTGCGCAAT CAGGCAAATG AGCGCATCTG 300 

ACGCAAAAAA G T AC TTTTTC TGCCATATTC GGTCCGGTGA TGAGCGCAAA AnCGGGCAAC 360 

AACGCATGTT CAATTGAAGA AAGTGTCAGA TCATTGGGTA CAAACTCCCG GAGGAGATGA 420 

AATTCCACCA CGGATTCTGC CCCCGTATAC GATGCACGTC TTGATAAGAC GGTGATCAGC 480 

AGTGACGCAn GGCGAAAGTG GGACTCAnTT GCAC 514 



(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 407 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 196: 

TTTGTGACAA nTGGTACCGG CAACTGGnAA GGCTTTCGAG TAGAATCCAA ATTTACAAAA 60 

CATTCCCATT AAAAGCACAG AAGGAAGAAA AATAAGGCAG GCGTTTCAAG CTACTGTTGG 120 

GCATGAGTTA ATTTCGGCAG ACTATACACA AATAGAGCTG GTCGTGTTGG CCCATCTATC 180 

TCAAGATAGA AATCTTCTCA ATGCATTTCG ACAGCACATT GATATTCATG CATTGACTGC 240 

TGCATATATT TTCAATGTGT CTATAGACGA TGTACAACCT GCAATGAGAA GAATCGCAAA 300 

AACTATTAAC TTTGGAATCG TGTATGGAAT GAGCGCTTTA GATTGAGTGA CGAACTTAAA 360 

ATTCTCAGAA GGAAGCGCAG AGCTTCCATT AC CGTTATTT TGAAACG 407 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
GGGTGGnGTT GGTCGGTCCC CCAGGAnGGT AAGACGTTGC TTGCACGCGC ATGGCCAGGT 60 
GAGGCGTCAT GCCCTTTTTT CGCATCAGTG GCTCAGACTT CATCGAAATG TTTGTGGGGA 120 
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TTGGCGCCTC GCGTGTGCGC GATTTTATTC AAACAAGCGC GGGAGAAGGC GCCAGGGATT 180 

ATTTTTATCG ATGAGCTTGA CGCAATTGGA AAAAGCCGCC TGAACGCTAT CCATTCCAAC 240 

GATGAGCGGG AACAAACGCT TAACCAGCTT CTGGTAGAAA TGGATGGGTT TGATAACACC 300 

ACCGGTCTCA TTTTGCTTGC TGCTACCAAT CGCCCCGATG TGTTAGATCC TGCGCTCCTA 360 

CGCCCCGGTC GTTTTGACCG ACAGTTTGCG TAGATCGGCC CGATCTTAAG 410 



(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 429 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

nGACGCAACA CTGACAACTG ATGGnTTCGC TCATCTCTCT CGGTTCCCTG AGTTCTTTTA 60 

CATACGGTCT CTTTGCCACG GCGCGTATAG CAATGGGCGT GCATGCCAAC GACACGGCAC 120 

TTGTATCGCA CTATGTGGCG GATTTGTATT TCGAATCAGC TGCAATGATC GTAACGCTCG 180 

TCACGGTGGG TAAATACCTG TCCGCCTTGT CTAAAGGGCG CACTTCTCGC GCACTCACAC 240 

AACTGCTAGA CATAAAACCT AAAACGCTCG CGTTATCGTC AGTATCTGTT CCGCGCGGAG 300 

ATCCCTTCTT CCCCAACAAT GCAACGCTGC ATCAGCCCAT GAGACC CATG AATTGAGATA 360 

GAAATCTGCA CAGGACGTAA TTGTTCGGAG AnACGTAATT GTAAAAGCCA GGTGAGTAAG 420 

TTCCGGTAn 429 



(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 374 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

CGCTATCGTA nGGAGGATCG CTGGGCTAAT TCGGTAAAGG CGGTCATTCT CTCCTTCATG 60 

GATGGTGGGT ATCACCTGAC GGGGTTGAAC TGCACTATCC TTTCTCAGAT TCCTCCTGAT 120 

GCGGGGCTGG GTACTCCCAA TGCGCTGAAG TTGCCATGGC CCTTGTGCTT GGAAGTTGTT 180 

TGCCGCTACG CTGCCAAAGG AAAGTGTTGT TTCGATCGTG GAACACGCAA ATGAGCGCTA 240 
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TCTCAAGACC CACGCACATC GCGCGGATAT TCTGTGCGTG TTGTTTGCAA AGCAGGGTAA 300 

CTGCGTGCGC AC TG ATnACC GCAAGAAGCA nGCGGAACTG TGTCAATTCC CTCGGAGGGA 360 

AACGTATGTG CTAC 374 
(2) INFORMATION FOR SEQ ID NO: 200: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 382 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : doubl e 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

TGCAGCAGGC GTCTGGATGT GCAGGAGCTT GCGCAGTTAC GGTGTGATCT GTCTCGAGTA 60 

CAGGAGCAGG TATATGCGCA GAATGAAAAA AACCGAGCCC TCCTCCGTGC GCAGTCAGCG 120 

ATTTGCGCCA CAGnCTCGAT GAATTTCGTG CGGCGCGTGG GCTGGTGAGC TATGAGGACG 180 

GGGAGGACTC AGGGACGGTA ATTGATCTGA GTTTATAAGC GCTTTTGCCA GACCATCTCA 240 

GGGGAGGAGG TGCGGCAAGC TGGATTTTTC CAGTCGGTTG TGTGACAATG CGCTGTGGAA 300 

CCCGCTGTTA TTTTGCGTCC CCTTTTGGAA AAAGGTGAGT TAAAGCAGAT GTGGAGCGTG 360 

CGCAACGCCG CGGnTATGTG CT 382 



(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

GGGGCCAAAT AATTCCGnTC CCAAATGGCC AAGnGGTTAA GCCAAAAAAA AAAAAGGCCC 60 

CGGATTTTGG GTTCTTAAGG AGGAAGGGAG GCCCCAAAAA GGCCAAAACC CACCCCCCCC 120 

ATTATTCGGA CTTCAAGGAG GACCATTAGG TCGGGCTTTC AGGCCATTAG AGGCCGCAGT 180 

AAGCAGCTGG CAGGCGTACC GAGCCTCCCG TTTCGGTACC GAGTGCAAAC GGTGCCTGGA 240 

CCGCCTGCGA CGGCGGGCAG nCAAACCGCC GGAACTTCCC CCGCTGGTGC GGCTCCGATC 300 

CCGAGGATTA CGCGTCGGCC CATAAACAGA ATACTCGGTG GAAGAGCCCA TAGCAAACTC 360 

ATCCATGTTC GTTCTCCCGA GCGGGATAGC ACCTGCGGGC GCGCAGnCGG CAAACAACGG 420 
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TGGGCATCGT ACGGGAGCCC TAT AG TC TG C AAAGAGTTTA CTGCCACACG TGGCAGTGCT 
TTCCTTTTCA CTGAAATATT GTCCTTGACA GCCAAAGGGT AGACCTAACA AAGGCTTACC 
T 

(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 
CAGTAACAAC GCCAACCGGT CTCATCAGCA ACACGGTCCC AGATGnACCA CCTTCnGAGA 
CGCTTCGGCT ACGGTCACCA ACTAGCACTT TGAAAAACAA GAACAACAAA AGAAAGAAAA 
TGACGT tCAA ACTGACCAAC CGAACGGCAT TTAACCCACG AgCAGCCTCC TAAAGAGAAC 
TCTCAATGAG TACATTGTAA CCTcCACCTc ATGTGTGTTA TGCCTCGTCT ACTCCCGGCA 
GGCCATCTTT TCAGGAAATA CCAAATGTTC CTGAAAGAAG ACTGGCACAT TCCCAGCACG 
ATATTGCTTG CGATAGTCAC GCATAGATAC AACGCCACCG GaTGCAGTGT CACGATAGCA 
ATCAGGTTGA TGACCGTCAT aAATACCATA AAAAGATCCG CAACACCCCA AACAAAATGA 
AAACTCGCAT GCGCACCGAC AAATACCGCA CTGACACAGG TAACTCTGAA AACACTCAAA 
ACCATTTTAT GGTCCTTAAT GAAACGTACG TTTGACTCCG CGTAgTAATA GTTACCCATC 
AGTGAaCTGA ATGCAAACAG AAAGATCGCA AGCGTCACCA AGTGCACCCC CACGGGGCCA 
ACTTGCTTGG ACAGCGCTTG CTGCACGAAC TGCATTCCGC TCACATCCAC TGATCCTGnC 
AACATCAGAG AGCAGCAACA CAAAAGCGTC ACACTACAAA TTAGCATCGT GTCTATAAAC 
AC 

(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 
ATCnCTCCGA GAGTTTGGTC GAAGGAGGCA TGGCTGTGTT CAGCATTCTA GAAAAGAAAC 



480 
540 
541 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
722 
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AATTTTCTCC GCAGTGTTTT ACTTGAAGGT GCATGCACCT GATATTGCAA AAAATCGTGC 120 

TGCAGGACAG TTTGTGCTTG TTCAACTTGa TGACGAATAC GCTGAGCGCA TACCGCTAAC 180 

GATTGCGGAC GCGCATGsGA TgAAGGGTGG ATTGCGCTAG TGATCCAGAC TGTTGGCGCC 240 

ACTACTATGA GGCTGTGCGA AAAGGAAGTG GGCGATTCCA TCTCTGTAGT TCTCGGTCCG 300 

TTGGGAAATC CAACTCTCAT TGAAAATGTA GGAACTGTCG CCTGCGTTGC AGgGGTGTTG 360 

GGGCAGCTCC GCtGkATCCT ATTGCCCAGG CGCATAAAAG GGCTGGAAAT CACGTCATTG 420 

TAATCCTTGG GGCGCGCAAT CGGGATTTAA TTATTTTTGA AGAGGAGATG CGCGCGCTTG 480 

CAGACGAGCT GGTCATTGTC ACAGACGACG GCTCATATGG ACGCAAGGGC TTAGTGACTG 540 

AGCCCCTGCG TGAnTGTGCG AGCGCGCGTC CTGTCCACAG GAGGTGGTTG CTATCGGTCC 600 

GCCGATTATG ATGAAGTTTT GTGCGGAAAC GAnGCGnCCC TTTGGGAT 648 



(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 366 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 

TGCACTGTAC TACACAGTGT ACGAATGCGT GAAGGTGATG TCTTCTGAGG TCGGTGCGTC 60 

TTTGTACGTG CACATCCCCT TCTGTGCGCA ACGCTGTGCT TACTGCGATT TTTACTCCCT 120 

GGTGCGTTCA ACCTATTTTA GGCCTCATCA GCCTTGTCCG CATTTTATCG ATCGGCTGCT 180 

ACAGGATGTG GCATTGCAGC GGGAGTGCTT TGGGGTCCAG GGGTGGCAGA CAGTGTATAT 240 

GGGTGGAGGT ACCCCTTCGC TATTGGCACC GCAGGACATT CGTCATTTTT GCGTACGTTA 300 

CGCGCCGCGC AGGnATTCCG ATTCAGGAGT TCACTCTTGA GGTGAnTCCT GAGGATGTGA 360 

CCGAAG 366 



(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 
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CATAGATGAC TTTGACGGTG CCTCCGAGGA TCAGGGTCTC GCCTGGCGTG CTGCGGGGAG 60 

CAAGTTCATC ACAAAGGGCT TCCCTATCCT CAAGTATTTC GAGGGCATGC CACAGGCGGT 120 

GCGCATGGCG GGCTCGTGGC AAGGGAAGGA CAAGGAAGCC CGGTTCATCG G AG T AGAGTG 180 

CAAGTTCAAT CGACAGGGGA ATAACTGGCT GGACCTAATT CCGACTAAGG GTGGTAGCGA 240 

TTACGAGATC CCCCTGCGTG GGGTGGTCAG TGGGTTCGAC GTGTGGGTGT GGGGTGCAGG 300 

TTATCAGTAC TCGCTCGAGG CTTTGGTTAG GGACTGCACG GGAcGAGTCC ACACCCTCCt 360 

AAtAgGCaAC CTCgAcTTCC aAGGGTGGAA rAAcCTTAgT GTTTCGGTTC CCACACACAT 420 

CCCACAGACG TCGCGCTATT TGGGGAGCGC GCAACAnCTG AATTTTGTCG GTTTCAGGAT 480 

CCGTACTAAC CCATCAGAnC GGGTGGATGA TTTCTACGTG TnCTTTGACC AGTTCAAGGC 540 

GCTTGCTAAC ATGCATATCG ACTTTT 566 
(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1601 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 
CAAAAGGCAG TTGACAGATG CAGAGGCGGA TTTTATC C AG AATTTGTGTA AAAATGATGA 60 

CCTTATCATG GCGCTGAATA TTTTTTCTAA CCGGGTGCAG TTGTTCTCCT GGACGTACGA 120 

CGACCTCATG TTCGTTGCTG GTGGGGGAAG TGAGTnAGTA CCGAGAAGTA GCTCTCTTTT 180 

CATAAAAATC TTTTCGGGAA GAGTGCCTTA TCAAGCCAAA GACGAATGCT TGTTGTTTAC 240 

CGCAGCGTTG AGGGTACTCA CTCCCGCGAA TATCTTTTTT AGGGGTTGCG TGGTCTTTCT 300 

GGTTATGCTC ACCCTCACCA TATCaAGCAT GGTCTTTTTG GTTATTGAGT ATCTGaGCGT 360 

GCGCCGATTC CaGTGCGcAC AGGGGGACGC GCGCCCTCGT CtGCTGGACC TCAAGGTAnG 420 

CGCGCTCTCG GACAGAnGAG GnCCCAGGTn TCGCCTGGTA CACCGCCTGA GTGTCCAGTG 480 

TCCGTCTTTT GAAACTCATG GGCAGGACCT TGCTTCTAAG GAGAGCGTGA GCGACCTCGn 540 
AAGGTTTACA TTTGTACAAC CATTCTGACG AGGAGCTTTT TTC C AT ACAT GACTCGGTGT 600 
TCCAGGAACA TACAGGGsGG GAGGGCCGGA TGTCTTCCTC TTCCGGTGAG GAAGGGTGCG 660 

GGCAGACGCT GCTTCCCGTT CATGAAAAAA AGAGCACCTA TGGTCTTTTC AATCCCTTrr 720 
CCGGrGTCGG TTGGCGGGCC TATCTGGAGG AAAGGCTTGA GGCAGAATTA GGACGCGCCA 780 
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CCGCTTCCGA ACAAGACCkA AsGCTAATGA TAGTACAGGT GGAGCACCCC GCGCACCAGA 840 

CAGCCGTTGC GGACGCAGCG AAAAAGCTCG TGGAGTTCTT CAAATTTCGG GATATGCTCT 900 

TTGAATTCGA GGGTAGTTGC TGCTTCGCGG GTATCGTACA AGACGCAAGC CTCGAAGAGg 960 

AnTGGTaCTC GCGAGGGaTA TACACAAGkA GCTGTGCGGC GCCATTGAGA GCGCACGCgT 1020 

CCTTATCGGC ATCGCGAaCG ckTACGTCCA GACTAACTAC CGcGGcCCAC TTaATTGAGG 1080 

ArGCGCACGC GTcGGTGAAG AGGGCGCGAG AAGACCCCGC ACACCCCATc ATCGcTTTCG 1140 

AGGCCCCCCA CCAGTGTGGG CGCCCGTATC GGTCTTCAGC TTATAGGGAT GTCCCGGACC 1200 

GGTCTACCTG CTAATAATGG CTTCCTTCTC GCCGCGCCGC GC AC TCTGC A GAAGCGCAGC 1260 

ACCCAACCTC CTGAAACTCT GGGAAGAAAC AGTTGCACCG GTGACCACAT CCACCATCTC 1320 

GGGATTACCC TTTTCAAGCA AAGCATCGGC GAGCTCTCTG AAGGCCTTTT CAGGACCTAT 1380 

GCCCGAGGAT GCATACATGA CCCGATGGTA GTCAGCGTCC TGGGACTTAA ACCGCCCTTC 1440 

TTTATGCTGA TAATCGTAAA CCACCTGCAC CATCTTGCCA CCATCAAAAG TAACCTCGAG 1500 

AAAGTCCTTC CAACCATTCT CATCAAAATC CTGATACGTC GCCCGGTACG TGCCATTCGG 1560 

GATAGAACTA AATGAACACG CCCCAAGCAA CACCGCAAGA C 1601 

(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 359 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

GTTGTnTCCT GTGCGACGCG nAAATACCTT CCCACACCTG CTGGTTAAAG GAC TTGTC AT 60 

CGATGTCCCC TGAATCGGTT ACCATTCCAA CGACGAATCA CCACCTTCTG CGCCCCCTGC 120 

GTTTC CCATC TGCGGCCTGT CGCTCTTAGA GCAGCCGATG AGCAGCATGG CGCAAAAAAn 180 

GCCCGCAAAn GCGCGTACCC ATTCTCTCTC ACAAGAATCC TCCCCCCTTT ATCGACAAAC 240 

ATGCGnAAAA TAAnGGGTCA CAGTGTAACC CAAGGGACAA nGAGGTnCAA AGAGTGGTGA 300 

GTTTTTGCGT GTGTGCAAGT GGCAAGGTGA AGGGGTACTA GACAGGCCCG GGGGGGGGT 359 
(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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<D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

TCTTGCAGAA AGTGAGGCGG ACTTCCGTGC AGGAACCGAA ACACTGCAGG ACCTTTGTCT 60 

GTGAGTCTTT CCGGGCATAC GCACACAAAA GAGGAGGTGC AGGCTGAGGT GGAGGCAGGA 120 

GAGGTTCAGT GCCCAGCGCT TTGCCATCAG CCGGTACTCG TGCAGGAATG CCTGACGTTG 180 

TTGGAACCTG CAATTGTGGG TATCTCGCGA GGTGCAGACA GCACGAGAGA TGGGGCGGGG 240 

GCGTTTTTTA TTGACGGGAC ACTGGGGGAT GGGGGACACA CACAGGCGTT TTTGCACGCG 300 

TACCCTGcGC TCCGTGCGCT CGGTGTTGAA ATAGATCCGT CAATGCTCGC ACGGGCGCGA 360 

GCGCGCTTGA CGCCGTTTGG CAAGCGGCTT cGCTATGTCC TGGGGTGGTC TGATGTCTTT 420 

TTTGCCTCCG CATATGCATC AGCTCCTGCC TCTCCTGCAA CGGGAAGGAC TGCAGCTGGC 480 

GCCGCAGTGT GCCGGGTGCG TATCCGGCGC CGCAGA 516 
(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 

C C AC AnAnAG GACCAAAAAG GACCAATGGC CCGGGGCCGG CCCCAACCGG CCACCTTCCA 60 

GGAACCGGCG GTGGCGGGTT GGCCCCCCGC GGTTTTTTAA AAAAAAAACC GGCCGGCCCT 120 

TTTTTTAAAA AACCTTACCA AAACCGGCGG TTTTCCAACC CAGGCCAACC CGGGGGGGGA 180 

AACTTGGCCC TTTACGGGGG AGAGGGTGTT AACACCCGGG CTTATTGGGC GCGATGGAGG 240 

TACGCACACG TGCAGCGTTT TACTCCCGAC GGCTGGCTCC GTACAGGGGA CGCTTTGGGA 300 

CAAAGACAGA AACCGGTAAT CTCCTCCCCT GGCAGCAGCT CGTGCCATAT GCAACTCGGT 360 

GCGCGCGGAG AAGCGGTGTA CGCAGAnGAT CTTGTTTGTG TGCTTATGCA AnATCCGTGG 420 

CGTGGTGGCA GCACACGTGC GGCGTTAGAC ACGCAAnGGG CAAGCGCACT GCGCCGTATG 480 

GGTAAAACAA GGAGCCGAAC GAATACGGGC ACCCTTCAGA TAGTGTGCTT TTTGCGCACC 540 

C 541 
(2) INFORMATION FOR SEQ ID NO: 210: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 
GGTGACCGTC ACCTACCGCA CCGTACATGG CGCATGCTCT TTTTGCCGCA ACTACATCAT 
GGCTAGAGTG ACTACCCGGA CCGGTCATAA CCCCGATGTG ATAATCCTCC ACCGCTTTCG 
TGCCCCCGGT ATTTTGTGGA GCACGCGCGC CCCCACAGCG ACACAACAGT GCACCGACAC 
ATACCACCCA TACCTTCAAT AGAAACTGAC TTTTCATAGT CTCCCCTTAA CGATCTGCAC 
ACACCATCTC TCCAAAACGC TTAGGCGTAT GGTCCACCCC CCCCCACGGT GGGAGGAnAA 
nAATTTTCCC AAAATTTTTG GGTTGGGTT 
(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 
TCGTTTGTAT CGTTGAGCGT GTTATCGCTT CAAGGGACAA AGGTGCCATA CTCATTTTTC 
TGCCAGGGGA GCGTTCTATT AAAAATTGTA TTACCCGTCT TTCCCATGAA CGTTGGTTCC 
GCAAGCTCTT TCTTTTGCCC CTCTATGGAA GATTGAGTAA AGAAGAACAA GAGCAAGTTT 
TTAACCGCGC GCCATTTGGA AAAAGAAAAG TCGTCATCGC AACGAATATT GCAGAAACAT 
CCATCACCAT TGACGATGTA ACTACCGTCA TTGACTCTGG TTTTGGGGGG TTnGGGAAAA 
AGGGTTTTTn AAAAAAAATT TTTCCCGGTT nAAATTTAAA AAACCCCCCC GGGGCCTTTT 
TTTTCCCCCC TTAAATTnAA ACCTTGGGCC CAAAAAGGTT TTTTTTGGGG GGAAAAACCG 
GGAAAAAAAA AnCTTTCCCC CnTTTAAATT TTTTTTCCCT TCCCAAGGGG GGCCTTTTTT 
CCCGG 

<2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 
TTGCATCGCG CGTCTGCAAG CGCGTGnACC AGC TTAnCCA TTGAAAAAGT ACACAAACCA 
AAAACAGGAG GGATTACGCC GAGCACCATA GAGCACGTAt TGGCGGAGAG GCATACACCG 
TGGTGCTCGA CCGCTCGTGG ACGTATGAAA AGGGCTGGAC CATTGCGCTG TTCCCCGATG 
ATAAGGCGGA TGAGGCTGGA GCGTATGTAA AAyCCCTTTC GGTTTCAAAA AACAAAGAGG 
GAAACTGGAT GGTTGCCATT CCCAACGCGG CGCTCAAAAC CGGGTCGTAC ACGCTGCGCG 
CAATAACGCC GCGGAATATT TACGCAGAGG TGCGGGGGAT ACTGCATGTA GTGCTCTGGC 
GCAGGCCCAT CTTTTTCTAT TATGACCTGA GCGTGGGATA CGCGCCGGTG TATCGGCCGC 
AGaCCACGCC GCGACAATCA ACGGTGTTTC TGACTTTTTC AAAATCTGTT CTCCTATCGG 
GTTTGTCGGC ACGTTTGAGA TGTGCTTTTT TAAGCGCAAC AGCAGCACCA TCAGCGCTGG 
CTTTAACGCG CAAATGCACT CCGATTCAAA ACAAGTGGAC GTGAAGCTCG ATGGAAACTT 
TGCGTATCTA TACGAACTTT ACCCGCGCAT CGAGGTAGGC GGCATGCTTG GGTTGGGGTA 
CTCGCTGCCA TTCGGACAGC GCAAGGAAGA CGACAGCATG TACTCCTACG TGACAGGAAC 
GATGAAGTAT TTTTTACTAA TAGCATTACC TGCGCGTTCA ACAGCAGCAC ATGTTGACCG 
TAAAGCCGAG TTTCACAGGA GTGAGCCT 
(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 
CTGCGCCCGC AGGnAAAAAA GTGTGTGCAG CAATCCAGTC TCTTCCATTA GGCGGAnGCA 
ATACTCGGAC GGGGAGTACA AGCTCTTAGA AAGTTCTCAC GCACCCGCTC AGAGAATACG 
TGCGTATGAG CGCGnAnATA AATTGCTTCA CGCGTGCGCG CTTCGATGGA AAAAGAnAAC 
TGCGCTGCAA AACGCACAGC ACGCAAAGGA CGCAACGCAT CTTCAGAAAA TCTTGCCTGT 
GCATCCCCCA ACGCTTAAGA TAAGAACCGT TACGCAAATC GGAGTAnCCG CCACATTACG 
TCGATGATTT CC 
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(2) INFORMATION FOR SEQ ID NO: 214: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

CTGCTCTATG TAGCGTTTTG CCTTTCGCCC TGTTTTCATT TCCACGATTT TGCCGTGGTG 60 

GAGGACGAnG GAACCGAGGA TCTGCACGTC ATAGGGACGC TCGCCTAAAA CACGACGAGC 120 

TGCCTCGCGC GCAAnCATAT ATTGCCTGAG GnAAAAAAGC GTCAAGCGCT TTCTTCCTGC 180 

AGCGGCACGC GCTCTTTAAA CTCAGCTGTT TTTTTGTTTG AACTCAGACT CCTGGAGAGG 240 

AAGTTACCCA GGACTTCCTG GGTCGTTTGT ACGGTCATTT CAAAAGAGGC AGGTAGATTT 300 

TTTTCAGAnC GCGCTCGTGT CTGGGTAGCC AAAGnAGTAG CCTGTAGTGT CAGTGTCGTA 360 

CGTAGTCATG GTAGTTACTG TTAGCGGTTC TGTAACCAnA AAAACAAGGG TAGACCGTCA 420 

CGGTCACCCC CT 432 



(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

AGTGGGAATA AGAGAGGGAG GAGACATACA GTTCAGCACA TGCTCACTCG ATCTTACTCC 60 

TAATGATAGA GCTAGAAACG TGCCATGGGA CTCCAAATCC CCTTAAATTG GCAAGTACCA 120 

ATGCCATCTT AACTACTTAA ACTGATCAGT TTCAGTTCAT AATTGATCAA AAAGATAGGA 180 

TTAAGTGTCA AAAGGATCAC ATGAATAAAA CCAGTGTCTT CTAATAATAA CTGATAGAAT 240 

TAAAAAGGAG AGAATGACTC AACATGGGAA GTGGGATACA CAGCAGACTC AGAATGACAA 300 

ATGTCCTAAA CAGCACTCTG GCCACACAAT CAGCTCTTAA GGCATTCGGA TCTGGCTAAA 360 

AAGCCAATGA AAGTTTCTCA GGCATGGAAA GCCAAGAACT GTGGCAAAAA ATGGCCTAAA 420 

TGAAACATCT GTGTGAGTGA GATCCCAGCA GAAAGAACGG GCCATCAAGG AAAGAGGTAC 480 

CTTTCCCTGA AGGGAGGAGA GAACTTCCAC ATTGACTATG GCCTTGTCTA AATAAGGAGT 540 
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TGGCGAACTC AAGAGGCTfC CATAAGCTTG GCAACTCATG ACAAGAGCCT TGGGTGATTA 
CTGATGCCAT AAACAAGAGT GTCAATTTGT T 
(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 
ACATAGTACA ACACATTAAG GACAGAGATC CTACATGGGG AGTAAGTGCA CAGTGACTCC 
CTTGTTGATT TAACAACTGA CACTCTTATT TATGACGTCA GTAATCACCT GAGGCTCTTG 
TCATGAGCCA AGGCTATGGA ATTCTCTTGA GTTCACAAAC TCTGACCTTA TTTAGACAAG 
GTCATAGTCA AAGTGGAAGT TCTCTCCTCC CTTCAGAGAA AGGTACTGCC TTCTTTGATG 
GCCCATTCTT TCCACTGGGA TCTCACTCAC AGAGATCTTT CATTTAGGTC ATTTTTTGCC 
ACAGTGTCTT GGCTTTCCGT GCCTGAGAAA TTTTCATGGT TTTTTTTTTA GCCAGATCCG 
AATGCCTTAA GGGCTGATTC TGAGGCCAAA GTGCTATTTA GGGCATCTGC CATTCTATGA 
GTCTGGCTGT ATATCCTGGC TTCCCATGTT GGATTGTTCT CTCCTTTTTA ATTCTATCAG 
TTATTATTAG CAGACACTGG TCTTATTTAC ATGA 
(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 
ATCCCTGTTA AATATAAGAG TGGGAATAAG AGAGGGAGGA GACGTACAGT TCAGCACATC 
TTCAATTGGA CTTAGCCATA ATGGTAGAGT TAGAAATGGG CCAGGGGATT CCAATTCAAT 
CCCATCAAGG TGGCATGTAC CAATGCCATC TCACTAGTCA AAGTGATCAG TTTCAGTTCA 
TAATTGATCA TAATGATAGG ATTAAGTGTC AAAGGGATCA CATAAACAAG ACTAGTGTCT 
GCTAATACTA ACTGACAGAA TTAAAAAGGA GAAAATGATC CAACATGGGA GTTGAGATAC 
ATAGCAGACT CATAGAATGG CAGATGTGCT AAACAGCACT CTGGGCCTCA GAATCATCCC 
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TTAAGGCATG CGGATCAGGC TAAAAAGCCC ATGAGAGTAT TTTAGGCnGG AAAGCCAAGA 420 

CACTCTGGCA AAAAACAAAA AAACAAAAAA AAAACAAAAC AAAAACACAC ACACACACAC 480 

ACA 483 
(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

ATTTTCATTT AGGGTGGTGT TTTTTTTTTT TTTTTTTGCC ACAGTGTCTT GGCTTTCCAT 60 

GCCTGCAAAA CTCTCATGGG CTTTTTAGCC AGATCTGAAT GCCTTCAGGG CTGATTCTGA 120 

GGCCAGAATG CTGTTTGGGG CATTGGCCAT TCTATGAGTT TGCTGTGTAT GCTGCTTCCC 180 

ATGTAGGATC ATTCTCTCTT TTTTAATTTT ATCAACTGTT ATTTGCAGAC ACTGGTCTTA 240 

TTTATGTAAT CCCTTTGACA CTTAATCCTA TCTTTTTGAT CAATTATGAA CTTAAACTGA 300 

TCACTTTAAC AAGTAAGATG GCATTGGTAC ATGCCACCTT AATGGGATTG AATTGGGATC 360 

CCCTGGCACA TTTCTAGCTC TACCATTAGG GGTGAGTCCG AGTGAGCATT TTCTGAACTG 420 

TACATCTCTT CCTTCTCTTA TTCCCACTCT TATATTAACA GGGATCACTT TTCAGTTAAA 480 

TTTAAATGAC TAAGAATAAT TGTGTGTTAA TTAAAGAGTT CAACCAA 527 



(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

CTGAGAAGGG AGAAGCAGCT TCTACACAGC TGCCTCCAGT TCAACCAATA AACTGTAGGA 60 

CCTGCTCCTG ATTGGAGGAG AGCAGCGTAC TCGGCGTGTG GGTAACAGAG TTGGGATTGG 120 

TGGAAGAGGA CTATAAAGGA GGAGAGAGAC AATATGCACC AGGAACATCT AAGGGGAACA 180 

TCTGGGGGAA CACCTGTGCA GCCCCCGAGA GAGCCGGCCG GCGGTGTGCC GCTTCCCCCG 240 

CGGAAGTGGG GAAAGTGGCT AGGGGGAACC GCCCTTCCAC GGAGGTGGAA GGGTTGGTAG 300 
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CCAACCCGGG AAGAACCAGC AGCAAACCCG GGGAGGGCCG AGCAGACGAA AGAACAACGC 360 

AGGTCCTGTG TTGTTCCTCC ACGAAGACGG GGAGCGACAC AGTATTCTGT GATAGAGGAC 420 

TTTGTAGCAC CATCAGTTAT TGAATTCAAG TCACTTCCAG 460 
(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

CCGTCTTCGT GGAGGAACGA CACAGGACCC TGCGCTGTTC TTTCTTCTGC TCGGCCCTCC 60 

CCGGGTTTGC TGCTGGTTCT TCCTGGGTTG GCTGCTGGTC CTTCCCACCT CCGTGGAAGG 120 

GCGGTTCCCC CTGGCCACTT TCCCCACTTC CGCAAGGGAG CGGCACACCG CCGGCCGGCT 180 

CTCTCGGGGG CTGCACAGGT GTTCCTTCAG ATAGATGTTC CCCTTAGATG TTCCTCGTGC 240 

ATGCCGTCTC TCTCCTCCTT TATAGTCCTC CTCCGCCAAT CCTAAC TCGG CTGCCCACAC 300 

GCCGAGTATG CTGCTCTCCT CCAATCA 327 



(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

nGGAAATTTT TCCAAAAAAA GGCCCTTGGG TTAACCCTTA ACCCGGAAAT TCCnTTTTCC 60 

CCCGGCCAAC CCGGCCGGGG AAACCGGTTA AATTTTGGGG GCCGGCCAnG TTTCCAAAAA 120 

AACCAAGGTT TTAACCTTCC GGAnAAAACn TTAACCCCAA CATTCCGnGG GGCTGGGGAA 180 

CCAACGGCTT CnAnAGGGGG AnCCACTGTG GATTCATGGG TTCGGCAGGC AACGTTGTCT 240 

TGnATAGGAT TGCGCACAAT TGGGCTGCCA AnGCATCGAG CATACTGCTT GCGTTTTTGC 300 

TCGTGCAATT TTACAGCGGC AGTCTGCTGG AACGGCGCGC CATTTCTGTT CCGTTAGTTG 360 

TGAGAAATGA AGGGCGCACT AACTCCTGCG CTTCGCTTTC CTCAAAAGGT GACGGTGCTG 420 

GATGCGCGCT TTCACGTGAT ACGCTCGGCG CACTGCCGGG ATCTGACATT GTCC 474 
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(2) INFORMATION FOR SEQ ID NO: 222: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 580 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 

GCGTCAGTCG CGTCAGATCC CCACGCGTCC GTGCCCGTTG CACGCGTGGA CACTGCCAAA 60 

GCGCAGTGGG CTGCTGGATA CCGCTGCTGC GGGGCTATAT CCGCGTAACA GTGTAAGCCA 120 

ACGGTGTAGG AAAGTGGCGC ATTTTGCGTG TATGCTACGG TGTCCGAGAA ATTACCAAAA 180 

CGGTGGAGGT GTATTATGAT TATCCTCACG CTAAACTGCG GCAGTTCATC TGTAAAATAC 240 

CAGGTGTATA ACTGGACAGA GCGTGCGGTG ATTGCCGTTC GTGGCCCGGn TTTGGGGGGT 300 

TTAAAGGnAA AGnCCGGnTT GGGTTTTTAA ACCTTTCCCA AGGGGGCCCA AGGGGGAATT 3 60 

TCCTTGGGnT TTTTAATTTC CCAACCGGGC CCAATTGGnn AAGGGGGTTG GGCCCnAACC 420 

GGGGGGCCCC CGGGnAAAGG nnAGGGnAAA AAACCCCCAA ACCGGGTTTT TTCCCGGGGG 480 

GGGGGGGAAA AAAAAAAAGG GTTTTCCCCC CCCCTTTGGG GnCCCCCCCC CCCTTTTAAA 540 

ACCCCCCCCC CAAATTTTAA AACCCCCCGG GAAAAAAGAn 580 



<2) INFORMATION FOR SEQ ID NO: 223: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

AnTAnGCTTG GTCCTAAGGG TCGCAATGTC TTGCTTGAAA AAGGGTACGG GGCTCCCACA 60 

GTCACGAAGG ATGGGGTTTC CGTTGCGAAA GAGGTTGAGC TCGAAGATCC GTTCGAGAAT 120 

ATGGGTGCAC AGCTTTTAAA AGAGGTGGCT ACGAAGACGA ACGACGTAGC TGGGGATGGC 180 

ACAACTACTG CGACGGTATT GGCGTATTCG ATGGTGCGTG AGGGTCTGAA GGCGGTTGCT 240 

GCCGGTATGA CGCCCCTTGA GTTGAAGCGT GGTATGGATA AGGCAGTTGC GATTGCAGTC 300 

GATGACATTA AGCAAAATTC CAAGGGTATA AAGAGCAATG AAGAAGTCGC TCATGTAGCG 360 

TCAGTATCTG CGAATAACGA CAAAGAGATT GGAAGGATTC TGGCAAGCGC AATCGAGAAG 420 
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GTGGGGAATG ACGGGGTCAT TGACGTTGAC GAAGCCCAGA CAATGGAAAC GGTGACGGAA 
TTCGTTGAAG GGATGCAGTT TGATCGTGGG TACATCTCGT CCTACTTCGT CACTGACCGA 
GATAGGATGG AAACGGTGTA TTGAAAATCC TTACATCCTT ATCCTACGAT AAGTCCATCT 
CGACTATGAA GGATTTGCTT CCGCTACTCG AGAAAATTGC GCAAACAGGT CGACCGCTGC 
TTCATCATAG CTGAnGATGT CGnAAGGCGA AA 
(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 
AGCGGAGACT TACCAACGAG CCGATATGAG CCGCGTATGA CTTGCCATTT ATCGTATGCT 
TTTGCGTATT CGCTTTTTCT GCAGCGACGG TTTGCTGGTT GAACACAAGA ACTTGCTGGA 
TTGCAACCGC GGTCGACTGC GCAACCCGGG CCTGTACCTG CTGGGTGATA ACGTTCTGCA 
CCACGCCAGT AATCTGCCCG AT AATGGGC C TCGCGGCGGC CGCCCCGGCA CTAAGGAAGA 
AGAGCTTGGT TACATTTGGC AATTGCGCTG CGATTGCGGT AAGTGAAGGC AAGTCAGGTT 
GCCGAACAGG ACTTTGCCGG ACGTTCACCG GAACGTCACG TcAATATTCA CTACTTGTGT 
TCCATTGGTG CGTTTAATCT GATCGTGTGC CTGCTTCACC GCTCCTTCAG CGATCTGAGT 
AATAACTGAT CTTGCCACCT GCACCGAGTT ATCTATGGTG CTACCAACCG TATCAGCCGC 
CTGTTTAGCC TGTTCCTGCG CAcGTGcGTA AAAATTGCCA GTGCAACCTC CTGTGCACGC 
TCGCGTGTCC TTTCGGTCCT CATCGCCGCG GTAGCCTCAC TCTGGTGTTG GTTACCGGCG 
TCGAGGGCGA AGGAGAAGCG GAAGCCGGCG CCTGGTTCGA GGGTGAGTCG GCCCCCTACA 
TTCCACAGCA GTTTATCCTT GTTCTGATTG TTTGCGTCCT TCTGTGCACC GATGAGGTAT 
CCGTCTTCTA GCGTAACATT GCTGGCAAGC TCTACCGTGC ACAGAGGGTG TCCTGCACGC 
GCATACATTA GCTTCAAGTC TGCCCCAAAG CCATACTTAC TGTGCGTGGG GTCAGTACTA 
TCCCAGGCAC CGTTAGAGGC AAAGGAGAGA AACCCCACAT CAAGGCTGAC CCCACTGCCC 
CCAATGTCCT GTGCCCGATA ACCAACCTTG nCGGCTAAAA CCCCAAAACC CGGGGCATAC 
TGTAACGCAT CCTCCTGGTA ATGCGGTGGT CAAnCCAAGG 
(2) INFORMATION FOR SEQ ID NO: 225: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 
TCAAAAATCT GAATATCCTA CCATATGACC CAACCATCCC ACCCCTGGGA ATTTACCTAA 
GGGAAATTAA ATCAATAAAT AAAAGAGTTA TCTGAACCCC CAGGTTTATT GCAGCTCAAT 
TCACATAGCT AAGACATGGA ATCAACCTAA ATGCCCATCA ACCAATGACT GGATAAAGAA 
ACTATGGGGT AnGTACACTA TGGAATACTA CATGGGCAGT AAAAAAAAAA ATTGAAATCT 
GGTCATTGGA CAACAAAATG GATGAATCTG GAAAACAGCA TAATTAGTGA ATTAAACCAG 
TCCCAAAGGG ACAAATACCA TATGTTCTCC TTGATCTGTG AGAACTAATG GAGTACCTAA 
AAGGAAATCT GTAGAAGTGA AATTGACACT TTGAGAAGGG ATGACTTGAG CTGCCCTTGT 
CTTGACTTTC AAGGAACAGT TTTTCTTTTT TCATTTTTTT TCTTCAkGCT ATTTGCTGAA 
CTCTTTAGTT AACATAGAGT TAATCATATA AAGTC A t TGA GGATGGATCT CAGTAAAAAA 
TAAGAGTGGG AATAAGAAAG GGAGGAGGAA GTTTTGTAAC TGTAAAGCTA TATAGTTATA 
CATACATTCC TATGTACTTA CTTCTAAGGC ACAGTTTAAA AACTTGTCAT GAGATCCCAA 
ATCTCATTAA GCTGGGTGGA AAAATGCCAT CTTAAGTGTT AAAGTGATCA TATTAGTGTT 
AAAGTGAACA TATAGATACG TTTAAGTGTT AAAGTAAACA TATAAATAGG TTTAAGTGTC 
TGGTAATAAT AATAGATATA ATTAAAAAGG AGAGAATTTT CCAACTTGGG AAATAGTCCA 
CA 

(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 
TACAAATTGA GTGTCACTGT GATTTTCAGG TTAATTCATT CAGAATTCCA CAGGCAGTCA 
GCTGGAGGGG TGGATGAATC CGGGAACAGC CGGTTTCCTT GCGACTAAAT AAAGCCTTAT 
ATCATAATGG ATTCTTGCAT TTCTTTGGCT TTTTTCTCCG TTTTCTTATT TATCATGGTT 
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TATAATGAAT TTTCTGATGT GAGCTTTTGT TTTGTGGCAG GCATATTGAG TTCAGGGCAT 240 

AAGAGGAAAT TTCTTTTTCC ATCAGGGCAC AGCTGTTGGC TTGCTCCAAT TCTGGTTTGC 300 

TGTCATTGTT TTCGGTGCCT CAG CCGATTG TGGTTTCTTG ATCAAAAATT TCCJTAAATGT 360 

TTTCTGGGTC TGAGTTGGTT TTGCCTTCTT CCTTCCCAAA TTTTGTATTC GGATGCAATA 420 

CAAACAAGTG TTTGGACAGA GTATGCATTA CGAAATAGAA ACACAACATT AAGGCGTTTG 4 80 

AAGCTTAGAC ATCTACAGGT AGCACGAGCA AGGTGAGTTT TTGTGTTTTG GAAAATCAAA 540 

TGGAACACTT TAGCTGAGGT 560 

(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 base pairs 
<B) TYPE: nucleic acid 
{ C ) STRANDEDNES S : doubl e 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

TGnAAACTTC AGTTTGGGAG GCAATTAATA TCTGTCACCT CAGCCAAAGT GTTCCATTTG 60 

ATTTTCAAAA ACAAAAAAAC TCACCTTGGC TCGTGTTACC TGTAGATGTA TAAGCTTCAA 120 

ACGCATATTT GCTGTGTTTT TTATTTCATA ATGTCTACTC TGTCCTCACT CTTATTTGTA 180 

TCACATCAGA ATACAAAATT TGGGAAGGAA GAAGGCAAAA CCAACTCAGA TCCAGAAAAT 240 

TTTTAGGAAA ACACCACTCA AGAATCCACA AATCGCCTGA GGCACCCAAA CCAATGAAAG 300 

CAAGCCAAAA TTGGAGCAAG CCAACAGCTG TGCCCTGAAT GnAAAAAAAA CTTCCTCTTG 360 

TGCCCTGnAC TCAATATGCC AGCCACAAAA CAAAAGTTCA CAGCCG 406 



(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : double 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

GAACTTGCAA AAAGCGTGGT AACCnGTATT GCACGGGTTA AGGATGATAT GTGTnAACTG 60 

CTTGATAGAG GCTTCGTGCG CATTCTTGGC CAAGCAGCAG GCAGAGTCGC CACTCCGGGG 120 

AAAAAGCTTT TGCTTCTCAG GCTCCCTGCA GAAATGGAGA TCGCGCGnCT ATACACCGTA 180 
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TACGCGCGCT CGGnGGCGTT nTGAGAACGT CGGGGATCCT CTAkAGTCGA CCTGCAGGCA 240 

TGCAAkCTTG kCACTGGCCG tCGTTTTACA ACGTCGTGAC TGGGAAAACC CTGGCGTTAC 300 

CCAACTTAAT CGCCTTGCAG CACATCCCCC TTTCGCCAGC TGGCGTAATA GCGAAGAGGC 360 

CCGCACCGAT CGCsskTCCC AACAGTTGCG CAcCTGAATG GCGAATGGCG CCTGATGCGG 420 

TATTTTCTCC TTACGCATCT GTGCGGTATT TCACACCGCA TATGGTGcAC TCTCAGTACA 480 

ATCTGCTCTG ATGCCGCATA GTTAAGCCAG CCCCGACACC CGCCAACACC CGCTGACGCG 540 

CCCTGACGGG CTTGTCTGCT CCCGGCATCC GCTTACAGAC AAGCTGTGAC CGTCTCCGGG 600 

AGCTGCATGT GTCAGAGGTT TTCACCGTCA TCACCGAAAC GCGCGAGACG AAAGGGCCTC 660 

GTGATACGCC TATTTTTATA GGTTAATGTC ATGATAATAA TGGTTTCTTA GACGTCAGGT 720 

GGCACTTTTC GGGGAAATGT GCGCGGAACC CCTATTTGTT TATTTTTCTA AATACATTCA 780 

AATATGTATC CGCTCATGAG ACAATAACCC TGATAAATGC TTCAATAATA TTGAAAAAGG 840 

AAGAGTATGA GTATTCAACA TTTCCGTGTC GCCCTTATTC CCTTTTTTGC GGCATTTTGC 900 

CTTCCTGTTT TTGCTCACCC AGAAACGCTG GTGAAAGTAA AAGATGCTGA AGATCAGTTG 960 

GGTGCACGAG TGGGTTACAT CGAACTGGAT CTCAACAGCG GTAAGATCCT TGAGAGTTTT 1020 

CGCCCCGAAG AACGTTTTCC AATGATGAGC ACTTTTAAAG TTCTGC TATG TGGCGCGGTA 1080 

TTATCCCGTA TTGACGCCGG GCAAGAGCAA CTCGGTCGCC GCATACACTA TTCTCAGAAT 1140 

GACTTGGTTG AGTACTCACC AGTCACAGAA AAGCATCTTA CGGATGGCAT GACAGTAAGA 1200 

GAATTATGCA GTGCTGCCAT AACCATGAGT GATAACACTG CGGCCAACTT ACTTCTGACA 1260 

ACGATCGGAG GACCGAAGGA GCTAACCGCT TTTTTGCACA ACATGGGGGA TCATGTAACT 1320 

CGCCTTGATC GTTGGGAACC GGAGCTGAAT GAAGC CATAC CAAACGACGA GCGTGACACC 1380 

ACGATGCCTG TAGCAATGGC AACAACGTTG CGCAAACTAT TAACT 1425 
(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

GGGGCTTTTT AACCATATCT GAATGCCTTA AGGGCTGATT CTGAGGCCAG AGTGCTATCT 60 

AGGATATCTG CCATTCTGTA AGTCTGCTGT GTATCCTGCT TCCCATGTTG GATCATTCTC 120 
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TCCTTTTTAA TTCTATCAGT T AG T ATTAGC AGACACTAGT CTTGTTTATG TGATCTCTTT 
GACACTTAAT CCTATCATTA TGATCAATTA TGAACTGCAA CTGATCACTT TAACTAGTGA 
GATGGCATTG GTGCATGCTC AATTGGACTT ACCCCTAATG ATAGAGTTAG AAATGTGCCA 
GGGAATTCCA ATTCAATCCC ATCAAGGATT TTATTTAATT TAATTTAATT TTATTTACTT 
AT 

(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 
CGGATCCCCG GTCAATTGTG GAGACAnTAT TATCAATTTA CCAAGAGAAA GCCGAGGTGG 
AAAGAGTTCA GAATTGAATT TGCCCACGGC nAATATnGTT AAAAAGTCTA GATTAAAGGC 
CAACCAACCA AAGTTAAATA TAGGCATTAG GATCTGGCTG AAGAGCCCAT GAAATTATTT 
TAGGCATGGA AAGTCAAGAC ACTCTCAAAA AAAAAAAAAA AAACTAnATG AAAGATCTCT 
GTGATTGAGA TCCCAGTGGA AAnAATGGGC CATCAAAGAA nGGTACTTTT CTCTTAAGGG 
nGGAGAGAAC TTCCACTTTG ACTATGACAT TGTCTAAATA AGATTGnAGT CAACAAACTC 
AAAAGGTTTC CATAGCCTTG GCAACTCATG ACnAGAGCCT AGGGAGATTT CTGACGCCAT 
AAACAAGAGT GTCAnTTTGT TAAGTCAACA ACAGGAGTCG CTGTGGCACT TACTCCTCAT 
GTAGGATCTC TATTCnTAAT GTGTTGTACA AGGnGAATTA ATGCTATAAC TAGTACTCAA 
ACAGTATTTT TCAC 

(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: double 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 
ACCCAACCTC AGCACTAACC TTGACGAGTC ATTTCTTTGA TTTGGTCATT GGTAAAATAC 
TGACCAnCCG TTTGAGCTTG AGTAAGCATT TGGCGCATAA TCTCGGAAAC CTGTCTGTTG 
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CTTGGAAAGA TTGGTGTTTT CCATAATAGA CGCAACGCGA GCAGTAGACT CCTTCTGTTG 
ATAAGCAAGC ATCTCATTTT GTGCATATAC CTGGTCTTTC GTATTCTGGC GTGAAGTCGC 
CGACTGAATG CCAGCAATCT CTTTTTGAGT CTCATTTTGC ATCTCGGCAA TCTCTTTCTG 
ATTGTCCAGT TGCATTTTAG TAAGCTCTTT nTGATTCTCA AATCCGGCGT CGTCAAAAAC 
AGGAAGCCTG GGTAACCCAG GTAGTGCAAC AGGCGACGCA GACAGTAACG GCTGGAGTTC 
GAAGCGCGCT GGAATCTCGG GGGACTACGT ACATAAACGC GCTAGAGGCA GTTCAGCCTA 
ATCCTGCTAA ACCTACCGGT AAGGnTGTGC AAAATCTTCA CACCCCGCAG GAAGTCCGCC 



(2) INFORMATION FOR SEQ ID NO: 232: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 628 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 
AAATAAAAAG AATCATCAGG AATTCCTACA AAGATCTATA TGCCAACAAA TTGGGAAACC 
TATAAGAAAT GGATAGATTT C TGAACAC AT ATAATCTACC CAAGCTGAAT CATGAAGACA 
TAGAAAGTCT AAACAGATCA ATAACCAAGA CAGAGTAATA TCAGTAAGAA AAACCCTCCC 
GCTTCCCATG TTGGATCGTT CTCTCCCTTT TTAATTCTAC AGTTAGTATT AGCAGACACT 
AGTCTTGTTT ATGTGATCCC TTTGACTCTT AGACCTATCA TTACGATCCA ACATGGGAAG 
CAAGATACAC AGCAGACTCA TAGAATGGCA GATGTCCTAA ACAGCACTCT GGCCTCAGAA 
TCAGCCCTTA AGGCATTCAG ATCTGGCTCA AGAGCCTATG AGAGTATTTT AGGCATGGAA 
AGCCAAGACA CTCTGGCAAA AAAAAAAGGG GGGGGGGCAA ATGAAAGATC TCTGTGAGTG 
AGATCCCAGT GGAAAGAAAA AGAACGGGCC ATCAAAGAAG GAGGTACCTT TCTCCGAAGG 
AGGAGAGAAC TTCCACTTTG ACTATGGCCT TGTCGAAATA AGATTAGAAT CGGCAAACTC 
AAAAGGCTTC CATAGTCTTG GCAACTCA 
(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 614 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 
TAATGGAAGT AGAGAATGGG AGGGAACTGG TAGGAAGGGA GAGGTGTGTG AGGCTGGGAA 
ACCACTACAA ACTTAATAAA AAATCAAATG CTGAGGTAGG ATGTCAACTA CGTAAAAGAA 
AATAGACCAT AGAATAATAA ATGAAAATAT ACCAAAAGCA CTTAAACATT TTCCTACTGT 
TGGGTAAATA GGTGAATTAC AGTTTTTAGC TTCAGGCAAT AAAAGAAAAT CTTTGTGGTA 
AGATTTCAAG TTTTTAAAGA AGTTTATCTT CACAATTGAT CACACTGATA GGTCAAAGAG 
TCAAAGGGAT CACACAAACA AGACTAGTGT CTGCTAATAC TAACTGATAG AATCAAAAAG 
GGAGAGAACA ATCCAACATG GGAAGTGGGA TACACAGCAG AC TC AT AG AA TGGCAGATGT 
CCTAAACAGC ACTCTGGCCT CAGAATCAGC CCTTAAGGCA CTCGGATCTG GCTGAAGAGC 
CCATGAGAGT ATTTTAGGCA TGGAAAGCCA AGACACTCTG GCAAAAAAAA GGnCCTAAAT 
GAAAGTTCTT CTCTGTGAGA TCCCAGTGAG TGAGATCCCA GTGGAAAGAA CAGGTCTTCC 
AAAAAGGAGG TACC 

(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 301 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 
GCCnGCCAAC CCAAGGnTTT TTTACCCGGT TTTTTGGCCA CCAAATTCCC CTTCCCCGGC 
CACCTCCAAG GGCCAAGGAA ACCATTCCCC CGGCACCCCA ACCGGACCAC TGCACTGGCA 
AGTnAAGnCG GCATGGGCAG CAGTCGGTGC AGGACCTGCA GGATCGCTCA TTCCTGGCGC 
TCCTCTCAGT GCGGGAGTCG GCTCTCGCGG CGnTGGGGAG CGTTGCCTGC GCCAGTAGAG 
CCGCTGCTCC GCCAGGCGGG ATGACGCATT GGGTGCGCTT GCAAGACTGT GGGCAGCGTG 
C 

(2) INFORMATION FOR SEQ ID NO: 235: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 240 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

AACGTGGCAG ACGTGGTAAC TACGTCATCG ATAAnCAAGG CATCACGCGG CACACGAGCG 60 

TGCGCCCCGA GCTCTATACT CCCTGCAAGA TTCTCAACAC GCGCAGCGCG ATCTAAAGTT 120 

TTGCTGCGCG AAACGACCCT CTTACTCGCA CCAACGCACG ATTAACGGTA AAACCAGCCA 180 

ATTCTAGTCG ACGCGACACG TCCGCAAnCG GGTCCCATCC TCTTCTCAGC CATCATGCAn 240 



(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

AAAAAAAAGG GnGGGGGGAA CCCAAAATTC CCCCCAAAAA nGGGTTTAAC CCAAAAAAGG 60 

GAAAAAAAAC CCAACCCAAA TTTAAATTTA AAAACCCTTC CCCCCTTTAA AAATTTTTTG 120 

GnAACCCAAT TTGGGAAnTT CCAAAGGGAA AAAAATTGGG AAAAACCCTT TTTAAAAACC 180 

CCCAACCAAA AACCCAACCC AAAATTTAAT TTAAGGTTTC CAAAAAAACC TTTTTTTGGG 240 

AGGGCCAATT TTTAAAAAAA CCCCCTTAAA AAAAAGGAAA AAAAGGAATT CCCCTTAAAA 300 

AAAATTTTTG GGCCAATGGG AnGAAAGAAA CCTGGCCTAG GnATTTAACC CCTnCCCAAA 360 

AnGGGATTCC TCCCCAAATG GAAGAACCTG GGGCCAnGCC TGGAATTTTC CTCCATCCnG 420 

ACCACCCTnC CCnGAnCTAG GGGGGAAGAA ATGGAAAACC AnCATGGTTT AAAAAAAAAA 480 

TCCCTTGTCC AATCCCAGAA ATACCGGTAA CCCCAGTTAG AnGCCTCCTC CATTTAATTA 540 

AAATGGAAGG GTGGAAATTT AAAAAAA 567 



(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 215 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 
CTCCATCCAG TCTATTAnTT GTTGCCGGGA AGCTAnAGTA AGTAGTTCGC CAGTTAATAG 60 
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TTTGCGCAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG TCACGCTCGT CGTTTGGTAT 120 



GGCTTCATTT CAGCTCCGGT TCCCAACGAT CAAGGCGATT ACATGnTCCC CCAGTTGnGT 180 
TGAAATAGTA ATCAGCAGGT TTTCGGGGCG AGTAT 215 
(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 8: 

ACTGACTTTG CTACTAATCA GACCTACATT TAACTCAGAT TAATAATGCT TGCTTAAGTG 60 

TCGCTCCCCC TCTTCGTGGA GGAACGACAC TAAATCCTGC CTAGGCTTCA TATCCGAGTC 120 

ACGGCACCAT TATGTCGCTC CCCCTCTTCG TGGAGGAACG ACACAGGACC CTGCGCTGTT 180 

CTTTCGTCTG CTCGGCCCTC CCCGGGTTTG CTGCTGGTTC TTCCCGGGTT GGCTGCTATC 240 

CCTTCCACCT CCGTGGAAAG GGCAGTTCCC CCTGGCCGCA TCCCCATTTC CGCAGGGAGC 300 

GGCAAACCGC GGCCGGCTCT TCTCGGGGCT GCACAnATGT TTCCCTTAAA AGTTCCCCAA 360 



AAAnGTTTCT GG 

(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



372 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 
GCATGCCTGC AGGTCGACTC TAGAGGATCC CGGGCATATA AACAACGCTG CCCCCGTTCC 60 
TGAAGCAGGA AGCGTACGAT AGTGAGCCAG CGGAAAGAGA AGTGCTGCAA GGCGGACCGC 120 
AAGCTCTGCA GGCACCCGAT CGCACGCAGC 150 
(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 



GCATGCCTGC AGGTCGACTC TAGAGGATCC CGGGGAGACG CTGAGCGCTC TCCTCGCCCA 



60 



CGAAAGACAC CGTGnGnGCC CGGTCCCTAG AACGGACGGT CCGCAAGGTA CTTGTACTCT 



120 



TGCCACTGTC TCCGCGCCTT CTCTTCTGCG 



150 



(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

GAGGGAGGGG AGGAGGGAGA GAGAGAGAGA AAGAGAGAGA GGTCCTCTGT CCACTGCTTC 60 

GTTCCCCAGA TGGCCACAAC GGGCCAGAGC TGAGTCGATC CGAAGTCAGG CGCCAGGAGC 120 

TTCTTCCGGG TTTCACACTT GGGTGCAGGG TCCCAAGGAT CTGGGACATC TTCTGCTGCC 180 

CTCCCAGSGCC ATAGCAGAGA GCTATnAGAA GCAGCCAGGT ACTAGAACTG GTGCTCATAT 240 

GGTATGCTGG CACTGCAGAC CAnAGCTTTA ACCCACTCTG CnACAGTGCC AGCCCTGAAT 300 

GTTTTTGAAT A 311 
(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 
TCTTGCATGC CTGCAGGTCG ACTCTAGAGG ATCCCCTCTG GGGTAGAAGG TTT TGGCT AT 60 
GACCCGATTT TCCTGTTGCC ACACCTGGGC AGGACGTTCG CTCAGCTCAG CATTGAGGAG 120 
AAGAACCGCG TCTCTCACCG GGCACTTGCG 150 
(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

GAAAATATCG GAGAGAGAGA CTAGCAAACA GCC TAGGGAA AAGCCGGACG AAAAAGGAGC 60 

CGGAAGAAGC TATTGAAAGC CTAGGCATAG ACTTGGATAC GGACTACGGG GGGAAGTTGG 120 

GAGAAATCTC TAAGGTCGAA AGCGAAAGTG AAAGCTAGAA CAAACAGATT CGGACGCGGA 180 

CTGTGGGGAG AGGC C AGGAG AAATGAGGGA GGAATATCGT TGGAGATAGC TTGGGGAAAC 240 

ATACCGGGTA GAGAAAACTG TTAGGGAAAT TGAAGCCGCG GGGGGCAGGC CAAGGCGGAA 300 

ACGAAAGCCA CTTTGGGGTT CTCAGGTTAG CCCGGGAATA GGGGGCAAAA AGTTGAAACC 360 

AGAAGCTGAG ACGTAAGCCA GATTGGGATC CGTCTGATTA GCCCGGGGAG CAAAGGACGG 420 

GAAGCCAAAT CGTGGGGCGG AGACGTACGC TGGGTTGAAT TCGCCAGGCT AGCCCGGGGA 480 

ACTTGGATTG AATGCTAGTG GTGGAGACGC AAGCTACGCT GTGTTACTCG CGGAAGCCGC 540 

CGCGTGCAGA GAGAGCACGG GGCGTGAGTA GATAGGGAAC GGGGCTGGCG TAnGCC 596 



(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

ATGCnTnGCA TGCCTGCCGG TCGACTCTAG AGGATCCCCG TCACTTGCCC CCAGCTCCAA 60 

ACATTGCATC GTGACCCGTG CACCTTCTTT TGCAATGCTA GAGAGAATGA TTACTGGAAT 120 

ATCAATGCGT AGACGTTTCC GCTGTTCAAG 150 



(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 489 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 5: 
TGTTCTGGGC AGGGTTTTCT GAGTAAGAGC CCCAAAACAC AGGCAGAGAT ATTCAAAGTT 60 
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TCACATTTGG GTTAACTGGA TCATATTTTT GCACGTTCAT GGCTGAAACA AGGTCTTAAC 

AAAACTCAAA ATTGACAAAT GAAATCATGT CACATTAAAA TGCTTCTGTA CAAAAGACAA 

GGTTTTATTT GTTTGTATGT TTTTATATAC CTGACTCTGA AAACCTTATG CnGGGGCTGG 

TGCTGTGGTG TAGCAGGTAA AGCCGCTGCC TGCnGTGCCG GCATCCCATA TGGGGGCCGA 

TTTGAATCCG GCTGTTCCAC TTCTGATCCA GCTCTCTGTT ATGGCCGGGA AAGCAGTAGA 

AGAGGGCCCA AGCCCTGGGT CCCTGCATCC ACTTGGCAAG ACCCGGAAGA AGCTCCTGGC 

TCCTGCCTTG GACAGGCGCA CTCCTGCTAA GCGGCCAACT AnGGAGTGAA CCAACAGATG 
GAAGACCTC 

(2) INFORMATION FOR SEQ ID NO: 246: 

( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 
CTAAAGTTnC CCCGGGGGCT AAGCCCTTGG CCGAAATTnC AACCCCAGCG TACGTCTCCC 
GCCCCCACGG TTTGGCTTTC CCGTCCTTTG CTCCCCGGGC TAATCAGACG GATCCCAACC 
TGGCTTGCGT CTCAGCTTCT AGTTTCAACT TTTCGCCCCC TATTCCCGGG GC 
(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 617 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
ATCAGAAGTG GAGCAGCCGG GTCTGGAACC GGCAC CCATA TGGGATGCCG GCACTTCAGG 
CCAGGGCGTT ACGCnTGGCA CCACAGCGCC TGCCCTGAAA CCTTTTCTTT TACACAAAAT 
GCAGATGGCT AATACTTCCA CTAACAATGT CCAGTATCAG GTTCAGCTAT GGTTTCTCTA 
GCTGGGTGTG ATACTTCCTT ATTTTTACTT GAAAAGCACA GTGACAAAGA GAGAGGGAAA 
GACACACATG GCTGGGCCAA GAGGAAGCCA GGAACCAAGA ACTCCACCCA GGTCTCTAAC 
GTGGATGGCA GGGCCCCAAG TATTTGGGCC ATCCTGCACT GCTTTCCCAG GAACATTAAC 
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AGAGAGCTGG ATTGGAAGCA GAGCAGTCAG GATTCGAACC TGCACTCTGA TATGGAAGGC 



420 



TGGCATCGCA GGTGGCAACT TAGCCTGATG GACAACAATG CTGGCCTTGT GATGTTTATT 



480 



TTTATGATTT TCTACAGCAG AAACAGCAGT TCCCAAATGC AGATATTTCC AAGCCTGCAT 



540 



AGACTCATAC TTCCTTTCAG GTAGCAGTGA CTGAGAATAG AATCTGCAAT CCCAGTGTTA 



600 



TCAACATTAC ATTCTAG 



617 



(2) INFORMATION FOR SEQ ID NO: 248: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 
CCCCGGGCCC AATTTAAACC GGGCCCCnTT CCTTTCCCCC AAAAAAATTT GGCCAAACCC 60 
CGGGAAGGnT TAAAACCCTT TTAAATTTTG GTTGGGCCTT TTTTGGGGGn CCAnTTAAAA 120 
AACCTTTCCC CAAACCGGGG GAACCTTCCA AACCCTTTTC CTTCCCCCTT 170 
(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

GAGCATTTTG AGAGGCAGCA AGTGATGGCT CATGTAACAG GTTTCTGGCC ACCTGTGTGG 60 

AGGAGCTGGA TTGAGTTTAT GGCTGCTGGT TTTGATCAGG GCCAGCCCTC ACCATTCTGA 120 

CCATCAGCAA AATGAACCTG TAGCTGAATG CTTCTCTCTC TCTCTCTCTC TCTCTCTCTC 180 

CCCCCAACCC CATCTCTCTC TCCCCGGTCT CTTTCTCTGT CTCTACCTTT CAGATGAATT 240 

TTTTTTAAAA AAATTAGTAT TTTGATGCAA AATTGTTTGA CATCTCTGAC CTTTTCATAA 300 

TACACCTTCT CCATTATCTT TTTGAGGACT GCTTTAAGCA TAGATTTGTA TGTAGATATA 360 

GATGTCTTTC GTCTTTTTTA AAAAAGATTT ATTTATTTGT TTTGAAAGTC AAAGTAACAG 420 

AAAGAGAGAG AGAGAGAGAG CTCTTCCGTT AGCTTGGTCA CTCCCCAGAT GGCCTAACAG 480 

CCAGCACTGG GCCAGGCGCC GGGTCTCCCA CACAGATGGC AGGGACCCAA ACACTTGTGT 540 
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CAACTTCTGA TGCTTTCCCA GGCCATTAGC AAGGAGGTGT ATTAG 
(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 
AACCAGGACA GAATCCGGCG CCCCGACTGG GACTAGAACC CGGTGTGCCG GCGCCGCAGT 
GGAGGATTAG CCTAGTGAGC CGCAGCGCCG GCCAGGAATA AAGTTAATTA AGTAGGTGAA 
AGACTTGTAC ACTGAAAATA CAAAACAAAT TAGAGAGTAC AAAAATAAGT GGAAAGACAT 
TCCAGGTTCA TGGATTAGAA GGTTTAACAT TATTAAAATG TAGTTTAAGG GGACAGCATT 
GTGGCACAGC AAGTTAAGTC ACCGCTTCCA ATGCCAGCAT CTCATATCAG AGTGCTGGTT 
TGAGTCCCAG CTGCTCCTCT TATGAACCAA CTTCCTGCCA ATGCACTGGA AAAGCAGCAT 
ATGATGGGCC CTACCACCCA TGTGGGAAAC CCAGTTGAAG CTCCTGGCTT TTGGTCTGGG 
CCTGGCCCAG CCCTGGCAGT TGAGACCATC TGGGGAGTGA ACCCATGGAA GATCTGTGTG 
TGTGTGTGTG TGTGTGTGCG TGACTGTGCA TGGAAGATCT GTGTGTGTGT GTGTGTGACT 
CTGCCTTCAA AATAAATTAA GAACCG 
(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 251: 
AGGAGGTTAA GACCTAGTGA ATGCGTGGAG CTTATGAACT GGAACTGTGA AAAAAAAAAA 
AAAACTGAGG ATGTGTGGGA GAACTCAGGG TGTGCCTGAG AAGTGAGTAC TCTCCGTGGG 
AGACACCACA AACTTGGTAC CCTTGGCTAC C C AG TGAG AG CCATTGCAGG GGAATCTGAG 
CTTACACTGA GGACTGAACA GATCCTTTGT GTGGTCCTTG GGACAGAGCA GAGGAATATT 
ATACACACTG GGGCTAGCGC CCAGGCACTG ATTGCCATCA AGGAGAAAAG CTCAGCTGAG 
CAAAATTACT TCCCTTCTGA ACACAAAAAG AGAGAGAGAA GTTTACTATG CCTAACCTGG 
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GTGTGTCACC TTTGGGCACA CCCTTAACCC TGAAGAACTG AGCCGAGCTC TCTGGnCCAA 
ACCCGTCAAA AGCCTCTAGn G 
(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 
ATGCCCAAGT CCTTGGGTCC CCGCACCTGC GTGGGAGACT GGAAAGAAGC TCCTGGCTCC 
TGCCTTCGGA TCAGCGCACT CCTTCTGTTG CGGCCATTCG GGGAGTGAAC CAACGGATGA 
AAGACCTCTC TCTCTTTCTC TGCCTCCTGC CTCCTGGnCC TGCCCCTGCC CCTTCCCCTT 
nCCnTACCCC TACCTCTATC TCTACTTCTC TGTAAGACTC TTCATTTCAA ATACATAAAT 
AAGTCTTAAA AAAAAAAAAG CCAAAGTTTT CTACAGTTTC ATTGGTTCCT GGGAAAAGAT 
GCCACCACAG TGATTTGCCT CCCAGCTGTG AGCATTCCTC CTTACCCTTA TCGGACCCAT 
CAGGATGCCT GGTCCAAGTC GCCCACCGTG CATAGGCATA CAGTGGATCT TGGGTGCCTG 
CTTCTGTGCA CATCCAATCT ATCTTCCTGA CCTCTGGCCC AGAATTATGG TCCTTGATCC 
TCCATG 

(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 47 8 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 
nGGAACCCnT GATAAGCAAG TGGCCAGGTG AAGGTGGAAA AATCAAAAAA ATATGGCATG 
ATATTATAGA AGCCAAGTAA AGAACATGGT TATATAGAAG GAAGTGACCA GCTATCTCAG 
AAACTGCTAG CTAAGTCATG TACAATGAGA ACTGAGGGAT AATACTTATA AAATGAGAAG 
GTAGAAGGAA TATGAAAATT GTCTACCAAC CTCTACCCAA AGCTATACCA CTTTCCAGGC 
ACCCTTGAGA GATCTTCCAC CATGTCTATA CACACAGATT TACTTGTAAT GTTAGTAGTA 
GTTAAGTCAT TTGCATTTTG GAGCTTTATA TGCCCATGGT TTATAAACAG AAACAGAAGT 
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TATTAAAATT TTAGAAAGCA TAGGAATACT GAGGATCAGT CTCCCAGTAC TACTATTTTA 
AGATTTTATT TATTTATTTG GAAAGAGTTA CACAGAGAGA GGAGAGGCAG AGAGAGAG 
(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 
AATGCTGGAG AGGATGTGGT GAAAAAGGTA CCCTAATCCA CTATTGGTGG GAATGTTAAC 
TGGTAAAACC ACTATGGAAA TCAGTTTGGA GATACCTCAG AAATCTGAAT ATAGACCTAC 
CACATGATCC AGCCATTGCA CTCCTGGGAA TTTACCCAAA GGAAATAAAT CAGCAAAGTA 
AGGAGCTATC TGCACCCCCG TGTGTATTGC AGCTCAATTC ACGATAGCTA AGACATGGAA 
TCAACCTAAA TGCTCATCAA CTAAGACTGG ATAACGAAAT TATGGGATAT GTACTCTATG 
GAACACTACA CAGTGGTAAA AAAATGAAAT CCAGTCATTT GCAACAAAAT GGATGAATTT 
GTAAAACATC AT AC TT AG T A CGATAAGCCA GTCCCAAAGG GACAAGTACC ACCTGTTCTT 
CCTGATCTGT GATAAGTAAT AGAGCACCTA AAAGAAAATC TGTA 
(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 
CCTTTTTACT TGTTGAACTC TTTATTTAGT GGAGCGTTAA GCCTGTGATG CTAAAATAAA 
TTAAAAATAT GTTATTGCCA AAATTAAAGG GAGAAGGGAG ACTGGGAnGG TGAGAAGAGT 
GGAACTAAGT ATCAAATTCT TAGGACTGTA TATATGAACT ACTTGAAAAC TGTTCTCTTT 
ATATTAATAA AAATTTAACA TAAAAGCACT GAAAAAACTA GTATATTTAA ATCCTCTACA 
AAATCAATTG CTATGTATTT CTACCTTCAA ACCCATAAAT ACTTGCTTTG TGTGTGTGTG 
CAGTGTGTGT GCATGTACAT ACCTAGACAC AAAAAAACTG TATGTGGGGC TGGCGCnTGG 
CACGCTGGGT TAATCCTCTA CCTGCGGCAC CGGCATCCTC TATGGGCTCC GAnTCTAGTC 
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CCGGnTGCTC CTCTTCCATC CAGCTC TCTG CGTGGGCCCA GAAAGG 466 

(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

CAAAGTCACT ACTTATCAAT AGTCATTTTG AATATAAATG GCCTCAACTC TCCAATTAAA 60 

AGACGCAGAC TGGCTGAATG GATTAAAAAA CAAACCCATC TACTTGCTGC TAACAAGAAC 120 

ACATCTTTCA ACAAAGGTGC ATGCAGACTG AAAGTGAAAG GTTGGAGAAA GATATTCCAT 180 

GCCAACAGAA ACCAAAAAAG AACTGnCATA GCCATCTTAA TATCAGACAA AATAGACATT 240 

AACACAAAAA CTGTTAAGAG AGACAAAGAG GGGCACTATA TAATGATTAA GCGATCAATT 300 

CAATAGGAAG ATGTAACTAT TATAAACATA TATGCAACCA ATTACAGGGT ACCGGCAGTG 360 

CAAAATAAAT GTTAATGGAC CTGAAAGGAA ACAAAACTCC AATACAATAG TAAAGAGGGA 420 



CTTCAATAGT CCACTTTCAG CAnGGACAGA T 
(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



451 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

CAGAACCATG GGAAACCACT ATGAAAGAAG GTGAAGGCAG AATATTTCCC AGTAAAAGTA 60 

CAAAGGAAAC TCAAAGTAAA CTATAGGAAT ATCTGTGGGA AAATGGTAGG GCCAAGTTGT 120 

TACTTATCAA TAATCACCTT GAATGTAAAT GGCCTCAATT CTCCAATTAA TAGATACAGA 180 

CTTGCTGAAT GGATTAAAAA ACAAAGCTCA TCTATTTTCT GCCTTCAAGA AACGAATCTC 240 

ACCAACAAAG GTACATGCAA ACTGAAAGCG AAAAGATGGA AAAAGATATT CCATGCCAAC 300 

AGAAACCAAA AAAGAGCTAG TGTAGCTATC CTAACAGCAA AACAAAATAG ACTTTAACAC 3 60 

AAAAACTGTT TGAAGAGATA AAGAAAGGCT TATGCAATGA TTAACGGATC AGnTTACCGG 420 

GAGATGTGAC TATnTAAnGT ATAnGCACAC nTTACAGTAT ACTGAAAACT GCTCCTGATG 480 
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AnCATGGTTC ATAGAGAnAT C AnAAG AG AA ATAAGAAATA AAAAACTGTT GAGGGACTTT 540 
AACAAAATAA GAAGTATGAC TCTGTGGATT ACTGTTATnA C T ATT ATGAT CATAAACCTG 600 
CTCTGATAAG CGCTTTnCAT AT ATC AC TTC TATCTTTA 638 
(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 

AGTCTCTnTC TCTCTCACTG TCCACTCTGC CTGTCAAAAA ATAAAAATAA AAACAAAAAA 60 

TAAAAAAAGT ATCTCTTGTG ATTGAAATTT GTGTTTTTTT TTATTTAACT C ATTTG CAT A 120 

TCTTTATCAA CATTTTGGCC ATCTGTTTTA TTCTCTGAAA TATCTTTTTG TGAAGTCTAT 180 

TTAATTTTTT TCTGAAGATT TACTTGTTTA TTTGAAAGGC AGAGTTACAG AGAGGGAGGG 240 

TGAGACAGAA AGAGCTGAGA GAGAGAGTGT GAGAGAGAGA TGGATCTTCC ATCTACTAGC 300 

TCCTTCCCTA AATGGCTATA ATGGCAAGGA CTGGGGCAAG TTTAAGCTAG GAG C C AGAAA 360 

CTCCATGCAA GTCTCCCATG TGGGTGGCnG GGnCCATGTA CTGGGG 406 



(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 443 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

AGAGAGCTGG ATCAGAAGTG GTGCAGTTGG GACTTGAACC AGTGCTCATA TGGGATGCCT 60 

GTGTCTCAGG GTTAACCTGT ACCACAATGC CTACCCTCAT TATACCACTA ATAATCTGGA 120 

ATCAGATTCA TTATTGCTTC AACTCCTGGT AAGGAGTATA GATCAGTAAA GGTTCTTAGA 180 

GTAGATTAAG ACAAGCTGCC TAAAAAAGCA AAGGTTTGAA GTAATAAGCT CTTGGAGAAA 240 

ATAGATGTTG TCGAAGGGAA TGCTCGAGTG ATTCTAATAA ACCGTCATTC CCTGTGATTG 300 

CTTTACATGG CGTGCAGTGC ATTTGGAACA AGACAGACGC CAAGTCAAAT CCTATTCCTG 360 

GCTATATGAT GTTGACTGGA GATAGTTCCC ATCTCTGAGA CCCAGTTCTT ACTGGGTAGA 420 
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(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

AGGTCTTTAA TCCATGCTGA GTGGATTTTT GTGTAAGGTG CAAGGTAGGG GTCTTGCTTC 60 

ATGCTTCTGC ACGTGGAAAT CCAGTTTTCC CAGCACATGG AACCCCAAAC CCCCTTAAGA 120 

TGGTATTTTT ACCACCCAGC CTAAGTGTTA AAGTGATCAT ATGGATAGGA TTGAGTGTCT 180 

GGTAATAATA ATAGATAGAA TTTAAAAGGA GTGAATGCTT CAACATGGGA AGCAGTCCAC 240 

ACAGCAGACT CATAATTGCT TTAAAAAGCA CTCTGACCTC AGAATCAGCC CTTAAGGCAT 300 

TCTGGTCTGG CTGAAAAGTC CACGAGAGCA TTCAGACATG GAAAGCCAAG ATATTGTGAC 360 

AAAAATGTCC TACACGAAGG ACTTAGATGG TGGAAAGAAG TGTCCATTAA AGAAGGAGGC 420 

ATTTTCTCTA AAGAGAGGAG AGAACTTCAA CTTTGCTTAT GACCTTGTCT AACTACGGAA 480 

TGAGTTTGTG GATTCAGAAG GCTTCCATAA CCTTGGTACC TCATGTCAAG AGCCTCAGAT 540 

GATCACTGAC ATCATACTTA AGAATGTTAA TTGTTGGGGC TGGTGCTGTG GGCACAGCAG 600 

GTTAAAGCCC TGGCCTGAAG AACTGGGCAT CCCCATATTG GGCACCAGTT CTAGTT 65 6 



(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

ATCAGTGCCT TGTAATTAGG TGCATATACA TCTATAATAG TTACATCTTC CTGTTGACTT 60 

GATCCCTTAG TCATTATATA GTATTCCTCT CTGTCTCCCT TAACTGTTTT TGTGTTAAAG 120 

TTTATTTTAT CTGATATTAA ATGGCTATGC CTGCTCTTTT TTCATTTCTG TTTGCATGTA 180 

ATATCTTTTT CCAAACTTTC ACTTTCAGTC TGCATGCATC TTTGTTGGAA AGATGCATTT 240 

CTTGTAAGCA GCAAATAGAT GGGTTTTGTT CCTTAATCTA CTCAGCCATT CTGTGTCTTT 300 
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TAACTGGACA GTTGAGGCCA TTAACATTCG ATATGACTGT TGATAAGTAG TGACTTGCCC 
TGCCCTTTCC CAAAGATATT CTAATATATG CTTGAACTCC GTGATCTTTA CGTGAGGTTT 
TCTCCTTACC TCTTCATATG AGGCCAGTTT CGTGTGTAAC ACATATTATG CATTTTTGCA 
(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 262: 
CnAAATTCAA AAACCCTGGA GGATGACATA TATTAACCAC TTATTCCAGT AGTAGCACAT 
GTACGGCTAT TATAAATATA GATGTATTGG GGTAGCTATT GCTTACTTAA CTATTTAAAT 
AGTCATTGGC TTAGCTTGTA TAGGTCAGGC TCAGCCGAGT AGTCCTGGCT TTCACTGGGC 
TACCTGGTGA ATCTTGAACC CGTGATATAC CAGGGAGGTG TCTCTGCTTC AGAGTATGGC 
TGGTTGTTGC CTGGGACAGT GGAACCAATG GCCCAAATGT CTCTCATCTC CAAAAATAGC 
CCAAGCTTTT TCACATAGTG TTTCCAACGA TCCAACAGGA AGAAAAGCAG GCAAGGCCTG 
GAGACTTAGG CTCAGAACCA GCTTACCTTC ATTTCTGCTG TGTTCCATTC ACAAAAGCAA 
ATCACAAAGC CAGCCCACAT TGAAGGGGTG AGAAATAATT TTTTTTTTGA CAGGCAGAGT 
TAGATAGTGA GAGAAAGAGA CAGAGAGAAA GGTCTTCCTT TTCCAATGTT TCACCCCC 
(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 
GCnCTTCCAG AAAAATCTTA TTT AC T AC TT TGGCCCTAGA AATCATAAAT GACTATGATA 
GAATGTAACT TTTAACAGAG GTGAGGAAGG CATGGTTTAA TATCCAGTTT TGAGAATACA 
ATTTTCATTT GTATTTTATT GATGTTTATA GTTCTTTTAA GATTTATTGA TTTATTTGGA 
AGTCAGAGTT ACAGAGAAGG AGAGGCACAC ACACAGAGAG AGAGAGGTGT TTTCCATCTG 
CTGGTTCACT CCCCAATTGG CAGCAGCACC ACAGCACCGA CCCCTTATAG TTCTTTTCAA 
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ATCATTTTAC ACTGTCTTTT TATTCATCTA GTACAGAGAC AGAAGAATGG ACACAGAGCT 
AACATAGCTG AGTATTCAAG GAGACAGGTT AAGGGGTTAA ATGCCTCATA TCATATTATT 
TATACCTTAG ATTAATTCTG GGGACAGTAT TATCTGGAGT TTACAAAGTA GAAACTGAAT 
GGAAAGAGCT TAGAAAAATA CGGTGTTTTT TAATATCATT AAAAAGGCCA ATCAAGGGGC 
CAGCACTGTG AAATAGCAGG TAAAGCTACT GCCAGCATGG ACATTCCATA TGGGTGGAGG 
TTGAGCCCGG CTAGTCTGCT TTCAATCCAA CTCTCTGCTA TGGCCAGGGA AAGCAGTGGA 
AGATGGCCCA AGTCCTGGGC A 
(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 
TGGCTTCTAG CTTGGGATTG GCTCAGCTCC AACTGTGGCA GCCATCTGGG GAGTGAACCA 
GTGGATGGAA GACCTCTCTT TCTCTCTGCC TTTGCCTTTC TGTTAACTTT GCCTTTCAAA 
AATAAATAAA TAAAAAGAAT TGAATTAGAG AACATCTAGC TAGTGTCCAC T AT AG AAC TG 
AATATTCATT TAAAAAATGG TTAGAGAGTG TATAGGGCAA AAGAAAGGGC TGCTTTCTCA 
AGTACATCGT CAATTCCCAG GGAGGCTGGG ATCAGCTTGC CTTGGGGTTG GTCAGGAGAA 
TGTCAAGGAT TACAGCAGCT CTTAGAGCCT GGTTGTGAAG GGAAGTAATG GTGATCAAAT 
GAGAATTGTA CCAGTGAAGG TCAGCAGAAA GGACCAGCTC TGCTGCTGAT GGTGGGGATG 
TAAGAGGAGC TTGGAAGCTA CTGCAGTGAA TGTGAGCTTT GGAG AT TT AT GTATTTGAGC 
TGTACAACTA TGGGGAAGAC TTTTTTTTTT TATCnTGTAA GCTTCAATTT TTCAAACTGT 
GAAATGGGGC GAATAATTAT AGACATAAAT ACAGAGCAAC AGTTTTGAAA TACCATAAAC 
CTCATGTCCT TCAACCAGTG nTATTCATAC CnTATAGGGG TATTGTGAGT TCC 
(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 459 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 65: 

TGGTGTTCAA AAGAGGAAAT CCAAATGGCC AACAGGCACA TGAAAAAATG TTCAGGATCA 60 

CTAGCAATCA GGGAAATGCA AATCAAAATC ACAATGAGCT TTTACCTCAC CCCGGTTAGA 120 

ATGGCTCACA TTCAGAAATC TACCAGCAAT AGATGCTGGC GAGGATGTGG GGGAAAAGAG 180 

ACACTAACCC ACTGTTGGTG GGAATGCAAA CTCGTCAAGC CACTGTGGAA GTCAGTCTGG 240 

AGATTCCTCA GAAACCTGAA GATAACCCTA CCATTCAACC CAGCCATCCC ACTCCTTGGA 300 

ATTTACC CAA AGGAAATGAA ATTGGCAAAC AAACAAGCTA TCTGCACATT AATGTTTATT 360 

GCAGCTCAAT TCACAATAGC TAAGACCTGG AACCAACCCA AATGGC CC AT CAACAGTAGA 420 

CTGGGATAAA AGAAATTATG GGACATGTAC TCTATAAAA 459 



(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 

GGAGTCTAAA AATATTATTT CATAGAAGTT AACAATATAA TGATGGAGAC CAGAGTTGTG 60 

GCACAGTGGG CAAAGCTGCC ACCTGTGACA CCAGCACCCC ATGTGGGCGC CAGTTCATGT 120 

CCCAGCTGCT GCACTTGCAG TCCAGCTCCC TGCCAATGGT TTGGGAAAGC AACAGAAGAT 180 

GGCCCAAGTG TTTGGGTCCC TGCCACCCAC GTGGGAGACC TGGGTCAAGC TCCTGGTTCC 240 

TGACTTTGGC CTGG CTCAGC ATTGGCCATT GCAGGTGTCT AGGAAGTGAA TCAGCAGATA 300 

GAAGATCTCT CTCTCTCTCT CTCTAACTCT TTCAAAATAA GTAAATAAAT ATTTTTAAAA 360 

TATATATGGT GGATATCAGA CGCTGGGGAG GGAAGTAGAG AGGGAGAGAT AGTGAAAGGT 420 

CTATGGTGGG TACAGCTGAA GAAAAGTGAG AAATTCTGAG GTTGTATTGC ACCATGTGAC 480 

AACAGATAAT GTGTGCCAAC AGATAATGTA CCATACAGTT CTACATTTAA GAAAAGAAAA 540 

CTAGAAGATA CAATTTTGGA TATTTTCACT ACAAAAGAAA ATGTTAACAA TTTAAGGAGA 600 

TAGATATATG TACCCCACTA GACATTTAAC AAGATAGACA TGTATCAAAA TGTCAAATGC 660 

TACCCAATAA ATATTTATAA ATTGTAAATG TCTGTTAAAT TTTAAAA 707 

(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 639 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

TGAGAGGCAA AATGTAGAAA GAGAACAGAG AGAGAGAGAG AGAGAGAGAG AGAGCGAGCT 60 

TTCATCTGCT GGTTCACTCC CAATTGCAGA ACCAGCCAGA GACAGTATAG GCAGAATCCA 120 

GGAGCTGGGA ACTCTGTCAG GGTCTCTCAT GTGGGTGGTA AGGGCCCAGA TACTTGGGCC 180 

ATCTTCAGTT GCCTTCCCAC ACGCATTAGC AAGGAGCTGG ATCAGGACTT GAACCAGCAT 240 

TCTGATATGG GATTCTGACA TTACACACAG CAGCTTAACC CACTATGCCA CAGTGGCGGC 300 

CCTTGCCCTC ATCCTTAATA ACCTATATAC TACATCTGCC TGACTACTAT CAAAGTGGCA 3 60 

GAACTTGGGA GCATCTCAAG ACATAGGAAT GGTGTAAGAA TTTTACTAGT GGGGCTGGTG 420 

TTGTTGCACA GTGAGTTAAG CCGCTGCCTG CAATGCCGGA CTTCCCATAC GGGTGCCAGT 480 

TCAAGTCCTG GCTGGCTCCA CTTCTGATCC AGCTCCCTAC TAATGCACCT GGGAAAGCAG 540 

CAAAAGACAG TGCAAGTGCT TGGGCCGCTG TCACTCATGT AGGAGACCTG GGTGAAGCTn 600 

CCTGGGCTCC TGGGGCnTTC AGCCTGGCCC AGTTCCTGG 639 



(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

TGCCTTTCAA ATAAATAAAA TAAACCTTGA AAAGAAAAAC TAATAGATTA TTAAAGGACG 60 

AAAGAAGCTT GAATACAATT GCTTTTTAGC AAGTGTTAGT TATTTTTGTG AGGACACGAT 120 

CCACCTCGGT GTCCTTGAGT GCAGCCTCTC AGGGCAGTCA CAAGTCTTCA GAATTGGTCT 180 

TAAAAGTCCT TTAGGAAGCT GCAGCGTTGG AGGGGGTCCA AAATCTGACA CCTCCAGGTT 240 

CTCCTCACTG ATGGGGAGAT CATGGCTTCT CAGCACAGCA CCAACTGGGG TTAGTGTTTT 300 

ATTTTGTGTT TAGATTTATG TACTTGACAG GCAGAGGGAG AGAGAGAGAG GAGAGAGAGA 360 

GAGAAAGAGA GGAAACCTTC ATCTGTTGAT TCACTCCCTA AATGTCCATA ACAACTGGGG 420 

CTGGACCAGT CCCAACCCAG GAGCCAGAAA CTCTCTCTGG ATCTCCCATG TGGACTGTAG 480 
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GGACCCAAGC ACTTGGGCCA TCAACTCCTG CCTTCCAGAT ACATCAGCAG GAAGCTGAAT 
CAAAAGTGCA 

(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 69: 
GTTGGGGGAT CTGGGATGGA GCTCAAGAGT TCCTGGCTTC TGCTTGATTG AGCTCTGGCC 
ATTTTGGGAA CGAATCAGCA GATGGAAGAT CTCTTGCTCA C TCTTTT ATT TTTTCAAAGC 
TTTATTTTAT TTATTTAAAT GGAAGAGTTA GAACGCTCTT CCATCCACTG GATTCACTCC 
CCAAATGGCA GCAATGGCCA GCGCTGGGTC AGGCTGAAGC CAGGAATTTC TTCTGGATTT 
CCCACATGGC TGCAGAGGTC CAAGGACTTG GGCCATTCTC CACTGCTTTC TTGGGCACAT 
TAGCAGGGAG CTGGATCAGA ACTGGAGCAG CTGGGACTTG AACCAGTGCC ATATGGGATG 
CGGGCACTGT AGGCAGCAGC TTTACCTGCT ATGCCACTGC GCTGGCCCCA TTCTCTCTTT 
CTCTGTCTCT TTCATTCTCT CTCCCCATCC ACCATCTCTC TGTCACTTTG CCTTTGAATA 
TATGAAAGTG ATTTTTAAAA ATnAAAGTAA TTCTTAATAA TATCAGGAAA TGAATTCTAT 
A 

(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 
CGGTTTATTT AAAAGATATG TTAAGGGACT TAAAGGGAGA CTTAGACTCC AATACAATAG 
TACTGGGGGA CTTCAATACT CCACTCTCAG AAATAGACAG ATCAACTGGA CAGAAGATCA 
ACAAGGAGAC AGTAGATTTA AATGACACTA TAGCCCAAAT GGACCTAACA GATATCTACA 
GAACTTTTCA TCTGACATTT AAAGATTTTA CATTCTTCTC AGCAGTGCAT GGAACCTTCT 
CTAGGATTGA CCACATCCTA GGCCATAAAG CAAGTCTCAG CAAATTCAAA AGAATTAGAA 
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TCATACGATG CAGCTGCTCA GACCATAGCG GAATTAAGTT GGAAATTATC AACTCAGGGA ■ 360 
ATCCCTAAAG TACACAGAAA CACATGGAGA CTGGAACAAC ATGGCTCCTG GAATGAACAG 420 
TGGGTCATAG GAAGAAATCA AAAGAGAAAT CAAAAACTTT CTGGAAGTAA AGGAGGGT 478 



(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 

GAATTGAGCn CGAAACAGGT GTAnCCATAA GTTTTGATAT GTTGTCATGT CACATCATTA 60 

GTTTCCAGAA AATTTTTGAT TTCTATTTTG ATTTCTTCTA TTACCCAGTG TTCATTCAGG 120 

AACTTGTTAT TCATGTGTTT GCATATGCTC TAGATATTCC CGAGTTGCTG ATTTCCAGCT 180 

TTTTTCCACC ATGGTATGAG AAGCTGCATG GTATGATTCC AATTCTTTTG ACATTGTTGA 240 

GACTTGCTTT ATGGCCTAGT ATGTGGTCAA TCCTAGAAAA AGTTCCATGT AC TGCTG AG A 300 

AGAATCTGTA TTCTTCAAGT GTAGGAATAA AAGTTCTGTA GATATTAGAT CCATTGGGCT 3 60 

ACAGTGnTGA TTAAATCCCT GnTTCCTGnT GGATTTCGTC GGGGACCGTC CATGCT 416 



(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

TTGTTCTTAC TGAAGTGCAT CAGTGGCCTC TACCCTGCTT CTGCTGCCTT CCGCCCTCCC 60 

TGGCCAACAA GCACTTGGAA GAGGAGCCCT GGATGAACGT TGTAAACAGA CCTCAAGACT 120 

CCCTTGGGAT GCTGGCATCC CAGTTCCTAG CGCCTGGGTT CCAGTCCTCT GCTTTCCTGT 180 

CCCTGCCGGC TAAGTGGGAG AACTGGATTG GGAGCAGGTG CAAGCATCTG GGGAGTGAAC 240 

CAGCAGAGAG GGGACCACGT TTACTCCTTT TCTCTCTGCC TCTCAGATAC ATGCAGATAT 300 

ATACAAGTTT AAAAGGAATG CTTCGGTTTT TGGCAAATTA TTTGTTAATA AAAATTTAAA 360 

TATTTCTGTC ACTTTTTAAA CATTTATTTA TTATTTGAGA GAGTTACGAG AGGGAGAGAC 420 
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AGAGAGAGGT CTCTATCTGC GGTTCACTCC CCAGGTGGGC TGCAAGGGCC AGGG 

(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
CGGACTGTGC CTGCATGGCC TGCATGTGAA CAGACTTGCC ATTGGCTGGG AGCGTCCTGC 
TAGGGGAACC TAGGGTATAT AATGGGGATA GGATTGTGTA GGGGGATGAG GGCTTGGCTT 
CTTTTCTTTC GCCTTGCATC TGGATGAATA AAGTTCCATG AGAACCGAGT AAGCAGCGAT 
CGTGTCGTTA CTATGCTGGA CTCTCGCGGG CAAGCGTCCG GCAGCCCCTA GACTCAAATG 
TTGTCAGAAG TTTGAGAACT GCCATCCTAA AGAATTTCTG ATGGGGCCAG TACGGTGGCA 
TAGCAGGTTA ACACAGTGTC TGCAGTGCTG GCATTCCATA TAGGCGCTGG TTCGAGTCCT 
GGCTGCTCCA CTTCCAATCC AGCTCTCTGG CTAATGGCCT GGGAAAGCAG CAGAGGGATG 
GCCCAGG 

(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 
CATCACCACC ACAAATGAAA ACTTAAGCAT GCACTGGTGA TCCACAGGAG AATGGAATTC 
CTCTCTCCTA TAGGCCTGAG CACCACCCTG CAGGAGGGGC TTTAAAAGGC AGATGCAGCA 
CATCTTTGAA TACCTTGTTT CTACTTGTGG TTGTGTGATT TTCTTTTTTT TTAAGATTTT 
CTTTATTTTA CTTGGGAGGT AGAGTTTGAG ACAGTAAGAG GGAGAGACGG AGAGAAATGT 
CTTCCCTATG TTGGTTCATT CCACAAATGG CTGCAATAGC TAGAGCTGCA CCAATCTGAA 
GCCAGGAGAC AGGCACCTCT TTCTGGTCTT CCAAATGAGG GCAGAGGCCC AAGGATCCAG 
GCCATCCTCC ACTGCTTTCC CAGGCCATAG CAGAGAGCTG GATTGGAAGT GGAGCAGCCG 
GGGCTAGAAT CAGCACCCAT ATGTGAACCT GTGCTGCAGG CAGAGGATTA ACCTACTGCA 
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ATGGTTGGCC ATTCCCTAAC ACCATGGCTG ATCTCTACTC AGGTTTGCAG TTTTAGGCAA 



600 



CATTTTGAGC TGCACA 



616 



(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

TGTGATAnTC TGnACTTTCA AATAAATAAA TAAATGAATC TTTTTTTTTA AAAAAGTGGA 60 

TTAGGGTACC TTTTCCCCAC AGCCTTGCCA TCATTTATTG TTTTTTTGAT TTATATATGA 120 

TAGCCATTCT AACTAGGGGG AGGTGAAACC TCATTGTGGA TTTTATTTGC ATTTATTGAT 180 

GGCTAGTGAT CCTGAGCAGA TTTTAAATTT CACCCTTTGA AAAATGCCTG CTCATGTCCT 240 

TTTTCCTTTG CCTATTTCTT AACTGGATCG TTTGTTGCTG CTGAGTTTCT TGACTCTTTA 300 

TAGATTCTTG ACATCAATCC TTTATCAGTT GCATAGTTTG AAAACATTTT CTTTTGTTAT 360 

GTCAGTTGCC TCTTCAGTTT GTTGGGTGAT CCTTTACAGT GCAGAATCTT CTTAACTTGA 420 

TGTAATTCCA TGGTCTATTT TTGCCTTTAn TGCCAGTGTT ATGGGGTnTT TCCAAGAAGT 480 

CTTT 484 
(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

CAGTGTTGCC TTTCAACTTT CCATCTTGTC CGTGTGAGGG GGTCCCTCAC AGGGGACGGT 60 

GGGCCCAGCC TGTCGTGTGC TCACTTGTCT GTGACATCAC GGATGGGAAC TGnGTGAAAT 120 

TTCAGCTCCC CAGATCTGAA ATTTCACAGA GCAGCCTTTG TCACCTTGTT AGAGAATGTT 180 

TTTCCATCTC AACTCAGCGG TAAACGGATC ATTTATACGC ACCTGTTTTC ATCTGGATGG 240 

TGAATTTACC TGGTGGGTGG AAATTGGATG TGAGATTCAC ACAGCCTCAG AGCGTCTGGT 300 
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CAGAGGTACT TACTGAGGCT GGCGCCATGG TGCAGTGGGC TAAGGCACCG CCTGTGACAC 3 60 

CGGCATGGCT GCTCCATTTG CAGTCCAGCT CCCTGCATAA AGCTCCTGGC TCCTGGCTTC 420 

AGCCTGGCCC AGCCTTGGTC ACTGTGGCTA TTTGGGGAAT GAACCAGCAA ATGGAAAGAG 480 

TCAGTCTCTC TCTCTCTCTC TCTCTCTCTC TCTCTCTCTC TCTCTCTGTA ATTCTGCCTT 540 

TCAAAAAAAA TTTAGGATGT ATGTATGTAT GTATTTATnT ATnTAT 586 
(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

TCACTTTAAC ACTTAAAATG CTATTTTCCC TCTTAGCACA AGAGGACTTG GGGTGTCATG 60 

TCAAATTTTT AAACTATACC CTTAGAAATA AATCTGTGGT AATGTATACA GAATTATGCA 12 0 

GCTTTGCAGT TACAAACTTC ATACACTTCA TAATTATGAC TTTAGGAACA TGGTGATTCT 180 

TTCCACTCTG CCTGTCCTGC CACCCACATC CCCACCCCTC TTCCTCCTCC CTCTCTTATT 240 

CC CTCTTTTA TTTTTGACTA GGATATATTT TAATTTAACT T TAT AC AT AT ATGATTAACT 300 

CTATGTTAAG AGAAGAGTTC AGCAAAT 327 
(2) INFORMATION FOR SEQ ID NO: 278: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

TTCTAAAAAG ATTTTCTACT TCTACTTATA TACAAAAAAA TAAAATATTT CTTTATCTAA 60 

TTGACACAAC CCTCTGTCCT GCTTTTTTGT TTAAAAGATT TATTTATTTA TTTGAAAGAG 120 

AGTTAGAGAA AGGTAGACAG AGAGAGAGAG AGAATCACCA GGAACTTTTT CCAGGTCTCC 180 

TGCTAGAGAG TAGGGGTCCA AGGACTTGGG C CATC TT AC A CTGCTTTCCC AGACCAAAAG 240 

GAGACAGATG GGCTGGAAGT AGAGCAGCCA GATCTTGAAC TGGCACCCAT ATGAGATGCT 300 

GGCACTGCAG GCTGTGGCTT TGCCCGCTAA GCCACAGTTC AAGCCCCAAT ATGTCCGGCT 360 
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TTTTCGGATC ATATCAGTCT ATGACGTGCC ACTTATATTA CTATTAATCA ATGGCACCTC 
TTACTCTGAA ATGTGATTAT CTTGTATGAT AAATTATACA TAAGTTCTTA AAATAAGTGT 
C 

(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 
GCGTGAGGAG CAAGATCAGC CTGGAAAACG TGTGACAGCG TACCACTTTG AAACAGCTGA 
GTATAAAGCA ATAGAATTTC CTTGGAGCTA CACATGCCAA ACACTAAAAA AGCAAGTTCC 
CCAGGCCAAC TGTTGCCTTC TTCTTCTTCA TTTTTTTTTT TAAAGATTTA TTTATTTGGA 
AGGCAGAGTT ACGAGAGAGA CGGAAAAATA GAAAGAGATC TTCCATCTGC TGGTTCACTC 
CTCAAATGGC CATTAACAGC CAGAGCTGGG CCAGGTTGAA GCTGGGAGCC AAGAGGTCCA 
TCCCAGTCTC CCCCATGGGT GCAGGGGCCA AACACTTGGG CTATCCTCCT CTGCTTTTCC 
CAGGCCCTTT AGCAGGGAGC TGGATCAGAA TTGGGGCAnC CGGGAnTTAA ACCCAGGCCC 
ATGTGGGATG CCGGTGCTGT AGGTGGATGG CTAACTCACT GCACCACAAT GCCAGCCCCA 
A 

(2) INFORMATION FOR SEQ ID NO: 2 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 
AACAGACATT CTTCGCAGAT CTAGAAAAAA TGATGCTGAA ATTCATATGG AGGCACAAGA 
GACCTCGAAT AGCTAAAGCA ATCTTGTACA ACAAAAACAA AGCCGGAGnA TCACAATACC 
AGACTTCAGG ACGTACTACA GGGCAGTTGT AATCAAAACA GCATGGTACT GGTACAGAAA 
CAGATGGATA GACCAATGGA ACAGAATTGA AACACCAGAA ATCAACCCAA ACATCTACAG 
CCAACTTATA TTTGATCAAG GATCTAAAAC TAATTCCTGG AGCAAGGACA GTCTATTCAA 
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T AAATGG TGC TGGGAAAACT GGATTTCCAC GTGCAGAATC ATGAAGCAAG ACCCCTACCT 
TACACCTTAC ACAAAAATCC ACTCAACGTG GATTAAAGAC CTAAATCTTC GTCCTGACAC 
CATTAAGGTT ATTAGAGGAA CATTGGGnGA AAnCC 
(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 
GGCACTGGGG AGTGAAATAG CACGTGGGCA CTGTCTTTCT GTCTCTCAAA CATGTGAACA 
CATTTTTTTA AAGAAGATGA CCAAAAATAG GTAAAGTCCT CTGTCGGACG TGTTCACACC 
ATTTAATTCA GTCTCTCGAT TGTTTTTACA AAAATAAAAG CCTTTTTTGT TTTACAACAG 
TTTTTTTTTT TAAGGATGTA TTTATTTGAA AGACAATTAG AGGGAGGTCT TCCATCTGCT 
GGTTCAGTCC CTAGATGGCC ACAGCGGC C A GGGCTGGGCC AGGCCAAAGC CAGGAACCGG 
GAGCTTCTTT TGGGTCTCTC AAATGTGTGG CAGGGCCAAG CAGTTGGGCC GTCTCCACTG 
CTCTCCCAGG CCGCTAGCAG GGAGCTGGGT CGGAAGCGGA CTGACGTTGC TGGCCTCGGC 
CTACCTGCTG GCACCGTAAG CTGGCTCCAG GACAGTTTGA TGGAGGTGCA GTCCAGCACA 
CTGTGTGTGT GTAAAAGTCA CACTTCCAGC ATACA 
(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 
GGAATGCCAG CCCGCCACTG CCGTCCGTCC TGGGTTGTTT CATACCCTAG CTCTTCTCCA 
CGTCGTGAAA GCCCAGGCGA GGAGACCTGG TGACTGTCCC GGGCACAGCA CTCGGTGAGG 
CGTTCCAAGG GCATCCCAGG GTGCAGGCGT GGGCTGCCAC GTTCTCGCCC CACTCCACGT 
GACCGCTTTT GGCCCTCAGG GACAGGGCGA GGATGTTGGC TGGCCCTGGC CGCCCTTCAT 
AGGGGTGCTC CTCAGAACCT GAGGGAGAAA TTCTTTTTCT CTGAGATTTA TTTATTTATT 
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TGAAAGAGAC AGAGATCTTT CATCTACTGG TTTACTCCCC AAATGGCCTC AACAGTCAGG 
GCTGGGCCAG GCCAAAGTCA GGAGCCAGGA ACTCCATCCA GGTCTCCCAC ACAGATGGCA 
GGGACCAAAG TACTTGGGCC ATCCTCTGCT GCCTTCCCAG GCGCATTAGC GTGGAGCTGG 
ATCAGAAGCA GGAAAGCCGG GATTCAGCTG GCCTCCAACG TGGGATGTGG GACAGAGCCC 
ACCCCTGGCA GGTTTTTTTT TTTnnGnnTT TTTTTTATAT TTnAT 
(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 
ACTCTGCCTG TCAAAAAAAA AAAAAAATCT TAAGGGGGCT CAGCTCAGCT GTGACTCCGT 
GATCCGCTAA TATGAGGTCT ATTTCCAGTC CCTCCTCCTC CATTTCCAAT CCAGCTCCTT 
GCTAATTCAC CTGGAAAGCC ACGGGAAGAT TGACCCACTA CTTGGACCAG CTGGGAGATG 
TGGAACAGAT GAAGCTCCCG GCACCTGCCT CCTGGATTTG CCCTGGCCAC CCAGAGCCCA 
TCCAGCAGAT GGAAGATCTC TCTTCCTCTC CCAACTCTCA GCCAGTCACC CCACAACTCT 
TTCACATAAA TAAGAGTACA TTAAATTTAA AAGAGAATTG GCCAACTAAG TCTCTGAAGG 
TGGGGGGGTT GGGCTGACCC TGGGGCATAG TAGGTTAAGC ATCTATCTGT GGCTCCAGTT 
TGAGATGGAA GGACTCTCTG TAACTCTGAC TCTCCAGAAA AAAAAAAAAA GATAAAAATC 
TTAAAAAGAA TTATTATTAT TATTATTATT ATTATTATTA TGGGGCCTGT GCTGTGGTGT 
AGCAGGTAAA ACTGCCACAA GCAGTGCCGG CATC C CAT AT GGGCTCCCAT TCGGAATCCC 
ATCCCCAGCT GGGCTGCTCT CTGCTAnGGT CTGGCA 
(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 
TAGAGATTTA GTAATCTTCT GAACTCTTTG CTTTCAACCA TTTTCAGTTG CCAAAGATCT 
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